Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I wonder — when GPTBot crawls my website, which has a number of translations performed using GPT, will it use all that data for training future models? That doesn't seem like a good idea, but I don't know how they could tell.


Models trained on the data from another model eventually leads to model collapse.


It’s like incest.


That's actually an excellently apt analogy.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: