Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This suggest a very scalable, easy approach to extract data from somewhat regular HTML...



I generally use xidel [1] for that type of task. Feed it xpath, css selectors or its own pattern matching thing.

[1] https://github.com/benibela/xidel


or just use xpath




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: