Hacker News new | past | comments | ask | show | jobs | submit login
Ask HN: Is HN used as AI dataset?
3 points by jb_briant 3 days ago | hide | past | favorite | 3 comments
Someone, somewhere is scrapping, right?





I've turned up several of my own HN comments using FastGPT, from Kagi Labs.

Whether that's training data or live search results I'm not entirely sure, but HN definitely contributes to results in that case.


There's are a couple of different APIs and full datasets are downloadable so the data is readily accessible without scraping anything.

if it's on the internet, it's in a dataset somewhere at this point.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: