Thank you for the down-votes. I guess all the experts on HN know how easy it is to simulate training data because they all took the 101 course on how to rotate/resample images. That is uniquely a image classification technique.
Please oh wise ones how do we simulate nlp data, numeric data, finance data, biological data and anything else machine learning is used for.
Oh you are able to classify dogs and cats in images after a 2 hour youtube. How nice.
Renesd is correct that "big data" is overblown. There are diminishing marginal returns - you need orders of magnitude more data for the same incremental gain (and this blows up well beyond however millions of cars Tesla can hope to run).
You're correct that data augmentation is only a marginal technique to squeeze out more performance, and not generally possible in many domains.
Please oh wise ones how do we simulate nlp data, numeric data, finance data, biological data and anything else machine learning is used for.
Oh you are able to classify dogs and cats in images after a 2 hour youtube. How nice.