| ||Ask HN: What is the best tool to infer data type of tabular data?|
5 points by mahalel 3 days ago | hide | past | favorite | 7 comments |
|Hi HN, I am looking for the most accurate tool which I can use to infer Data Types from Tabular data (csv,tsv,excel)|
I need to be able to perform some small customization, if possible, to the detection algorithm. For example if I have a 9 digit number, starting with 0, then treat it as a String.
So far - I have found Frictionless Framework  which seems good, but I can't see any way of specifying customizations to the profiling algorithm, and Data Profiler  which uses ML for type detection, and it seems I should be able to train some new rules but I need a CUDA capable machine, which at the moment I do not have.
Hoping the collective HN brain can point me to something better if it exists.
 - https://framework.frictionlessdata.io/
 - https://github.com/capitalone/DataProfiler
| Apply to YC