Hacker News new | past | comments | ask | show | jobs | submit login
Web Scraping with C++ (scrapingbee.com)
20 points by daolf on Sept 22, 2022 | hide | past | favorite | 5 comments



I was playing with gumbo several months ago and made a nicer C++ wrapper that allowed for some interesting ways of doing queries. It was fun playing with combining the predicates and optimizing the result type so that it did less work. https://github.com/beached/gumbo_pp


Very nice looking slides on your GitHub project page. What did you use to create them?


Thanks :)

It was powerpoint and some screen captures of the relevant code blocks.


It uses libcurl and gumbo (https://github.com/google/gumbo-parser). Gumbo is apparently written in pure C99 (interestingly Curl is written in the even older C89 standard). Will've been more amusing if article was written considering that and used C99.


The go-to reference for web scraping used to be weboob.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: