Hacker News new | past | comments | ask | show | jobs | submit login

Thank you for the down-votes. I guess all the experts on HN know how easy it is to simulate training data because they all took the 101 course on how to rotate/resample images. That is uniquely a image classification technique.

Please oh wise ones how do we simulate nlp data, numeric data, finance data, biological data and anything else machine learning is used for.

Oh you are able to classify dogs and cats in images after a 2 hour youtube. How nice.




Both you and renesd are correct.

Renesd is correct that "big data" is overblown. There are diminishing marginal returns - you need orders of magnitude more data for the same incremental gain (and this blows up well beyond however millions of cars Tesla can hope to run).

You're correct that data augmentation is only a marginal technique to squeeze out more performance, and not generally possible in many domains.


From what I've observed of member behavior on HN, I suspect that the downvotes may be a response to tone as opposed to content.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: