Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I saw Musk saying a couple of days ago that we've "hit the limit of peak data" for training AI. My immediate reaction was no, surely you have not trained on every copyrighted textbook on every subject ever written. You hit the peak of easily accessible internet data that you could quickly steal to train your models.



Meta famously used libgen to train, right? That is basically a source for all copyrighted textbooks and more.


You might not know it, but there is no data for AI in robotics.

Everyone has to collect their own data and pool it together or else there won't be any progress.


The 82TB Meta trained on is still a lot of textbooks.


I can’t help but think that’s the real reasons he wants five billets from every federal worker every week. Free, hot, and fresh data!


Eventually humans ability to create new fresh data will be the justification for UBI. Fo shizzle




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: