Hacker News new | past | comments | ask | show | jobs | submit login
Show HN: Handinger – API to extract data from the Internet (handinger.com)
4 points by masylum 5 months ago | hide | past | favorite | 3 comments
A few months ago, I was developing a bookmark manager for https://fika.bar. I needed a way to fetch metadata, screenshots, and markdown content from websites. Since Fika is a donation-based product, I couldn't afford any of the providers out there, so I decided to build instead of buy.

"Easy," I thought. But weeks later, I found myself building data pipelines, implementing proxy rotation schemes, and dealing with TLS fingerprinting. Not fun—and definitely not how I wanted to spend my time. My product was a bookmark manager, not a data extraction service, yet most of my efforts ended up there!

It turns out that the Internet has become a difficult place to extract information from. Many websites have implemented measures against data extraction, making it more challenging and expensive than it should be to build products that rely on web data.

Handinger is my attempt to fix this. During that time, I also noticed that people training LLMs need to fetch data at scale. So, I extracted the work I'd done for Fika and built a very affordable, general-purpose scraping API. I managed to optimize it so I can price it at only 0.0005 EUR per URL—peanuts!

Try it out and let me know what you think :)




Hi , if you want to check if our sdk fits your need for extraction, we can discuss any solution since "I couldn't afford any of the providers out there" . https://github.com/docwire/docwire


Hi, Congrats on launching the product.

Curious to know how you arrived at this pricing which looks really good. I have experienced that a larger cost in systems like these are the data transfer costs. Would be great if you could share some insights on that too.


I did a lot of optimizations with a tiered approach: starting from cheap (and egress-free) and then fallbacking into more expensive ones.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: