Hacker News new | past | comments | ask | show | jobs | submit | t_a_v_i_s's comments login

Nice work. I’m trying something similar with https://kadoa.com/playground


What did you try?


I tried the specialized bikes example and it said I need to use the API since it requires proxy.


There's a lot of "well I just saw the latest version of GPT do exactly what your startup does on Twitter today".


Are you writing any checks or holding off until the dust settles?


Like anything new, there will be winners and losers. The winners will find unique wedges and create incredible value. The losers will at the very least jump on the hype cycle but then quickly become redundant and obsolete.


I think Import.io, Bright Data, and Zyte would fall into that category.


These are web scraping services. Data ownership is a grey zone at best, depending on which country you're in. Besides, copyrighted data might be scraped by accident. What I am proposing is much stronger than that, rather like an "audited" dataset that comes with guarantees because its curation can be fully backtraced.


I'm guessing you can pay one of these companies to meet these types of requirements.

I know that highly-regulated financial institutions that purchase web-scraped data have very strict rules about the data they buy.

The US has an organization dedicated to this: https://www.investmentdata.org


Makes sense. Basically the FAANG strategy.


I'm working on something similar https://www.kadoa.com

The main difference is that we're focusing more on scraper generation and maintenance to scrape diverse page structures at scale.


You might want to try https://www.kadoa.com (disclaimer: I'm one of the founders)



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: