We have all seen the recent large deals made by tech companies to purchase access to various types of data for training their models (or Reddit, Photobucket). I have also seen some articles about the industry’s ever growing need for unique media and data that seem to suggest the existence of a market and brokers in need of new sources that are not online. They seem willing to pay, but I don’t see an obvious way to sell.
I believe I have access to troves that have never and will never be online. Some quick research has not turned up any obvious marketplace online or who to talk to.
Is anyone here in this business or have any advice or resources for people like me who want to explore offering training data for sale or license?
Cloudflare's new marketplace lets websites charge AI bots for scraping
https://news.ycombinator.com/item?id=41625903