Hacker News new | past | comments | ask | show | jobs | submit login
Ask HN: What online tool do you use for data cleaning?
2 points by martin_drapeau on Dec 20, 2018 | hide | past | favorite | 3 comments
Been looking for an online tool that's easy to use for tabular data cleaning such as date parsing and formatting, column split, etc. Couldn't find anything so I created Data Janitor (https://www.csvjson.com/datajanitor) to scratch my own itch.

Wondering what people out there use. I prefer an online tool - don't want to install anything. Also a point tool - don't want to have to sign up.




This looks like a nice ShowHN. Look for the details and format in https://news.ycombinator.com/showhn.html

Use a title like "Show HN: Data Janitor - Online spreadsheet cleaning and transformation"

(Wait for resubmitting until Dec26. Nobody reads HN during Christmas.)

Some questions/remarks:

- It's not 100% clear that you must click in Clean & Transform to edit the transformation script

- If you edit the process script but don't click on "Run", then nothing happens (obviously). Perhaps you can add a small warning in the main page "Outdated results. Please click "Run" to process your data with the new script"

- Does the process happen in the browser or in your server? How private is the data I upload? (I'm pessimistic and assume not private at all.)

- How long do you keep the saver versions? Are you doing some filtering to prevent spam/overuse/ilegal content?

(Once your site is big enough to get some spam/abuse/ilegal content, you can write a nice post about how you combat it.)


In an article, pati11 recommended SheeUS. https://www.kalzumeus.com/2015/01/28/design-and-implementati... HN discussion https://news.ycombinator.com/item?id=8960280 (107 points | Jan 28, 2015 | 35 comments)


Thanks, that really helps.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: