Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It can make a lot of sense for web scraping, if you have lots of target sites you can either build strict rules for the extraction and update them constantly, hand build something generic (often very hard) or train some classifiers for the content you want.


Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: