Hacker News new | past | comments | ask | show | jobs | submit login

That does not sound like a safe assumption to me at all. If someone will pay for that data, I am quite sure it is happening.



It’s especially easy to imagine happening implicitly: feed everything into an ML system and if it tags it with something like “(sex noises)” or “(moaning)” (which Hollywood subtitles and other things in someone’s training data probably have) that’s searchable without anyone explicitly setting out to build a system.


I'd assume they are more likely to get valuable data from people talking about their sex life, if companies are actively trying to monetize in that way


Oh, sure. The point is just that nobody has to put a business goal in to directly create that feature for it to be exposed by a general purpose classifier.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: