I am pretty much every stereotype of a lefty American, but even I have some hesitation of using Vox media to train AIs. Vox is pretty non-apologetically partisan, which isn’t necessarily a problem but I don’t know is a good idea for training AI.
I do actually really like Vox, but I dunno, this is giving me a bad vibe.
my uneducated guess is that the partnership is because Vox has a lot of explainers written and used to embed these little factoid explainer cards in their articles which are probably useful from a data point of view
also - I think this is for some kind of RAG, not training
I'm still unclear whether these partnerships are "we're going to make a special segment of our training corpus that is text from Vox Media because we think it's extra valuable" or "we almost certainly scraped a bunch of Vox data when we built our training corpus years ago, and now we're paying them to not sue us".
https://www.threads.net/@reckless1280/post/C7juyV6xRrY Nilay from the verge sums it up well here. There is a firewall between editorial and commercial parts of the business. They'll disclose and continue to report, just like they already report on Netflix (they have a netflix show), The Verge has taken money from Comcast but antagonize them as well.
I do actually really like Vox, but I dunno, this is giving me a bad vibe.