Thx. The Springer Nature PDF is somehow STILL confusing as it addresses family ties to NY but not to Axel Springer. However, I guess my priors regarding family business were overstrong.
The entire article is about the semantics of the word "open", so an organization literally named "OpenAI" is obviously relevant on the article's own terms.
Besides which, this is a summary of a research paper that DID include OpenAI on the list. Omitting it is one of several choices that change the takeaways of the paper (linked above), which did a much better job at acknowledging and sorting the benefits of different varieties of "open"-ness.
Overall, TFA takes a cherrypicked scope on the naming controversy that hides the fact that some participants are contributing much more to public research and access than others.
Which models claim that they are "open source" but are not? I think none of them. I think people confuse "open weight" with "open source." That confusion comes from the "internet AI experts," not the labs that produced them.
Facebook has been touting their work as both "open source" and "open science", despite being in direct violation of the terms [1, 2]. Others have done similar things, but not on such a big scale and with models with such big reach. In the community, the terms have now become sufficiently muddied that it is hard to have a conversation about what an open model even is.
Thankfully, OLMo [3] reverted their decision to go with a license that contained usage restrictions and hopefully this sets the standard going forward so that the corporate actors no longer can appropriate the term and will find a suitable term on their own such as "available". But it pains me to see that they had to write "truly open" to distinguish themselves from the "poseurs".
Also from the Llama 3 announcement: 'Today, we’re introducing Meta Llama 3, the next generation of our state-of-the-art open source large language model.'
The confusion also stems from companies deliberately using the term for marketing purposes. Strong separation between the open source, academia and business environments doesn’t exist as much as it used to. Can’t imagine a Richard Stallman or Linus Thorvalds engaging in blurry half-open source, half business projects.
Aside from what others have mentioned (tons of them do use the term open source), "open weight" is deliberately meant to parallel "open source" but in practice it doesn't. Most "open weight" models have usage restrictions that would make a code license most definitely not open.
The right term for most of these models would be "weights available".
I think France (Macron) help push through the 'open' exemption, whatever that practically turns out to be, for the final EU AI Act...obviously not lost on people that is where Mistral lives.
When they say “LLM data”, does that usually include the tokenizer as well? Beginner question from someone at the end of Karpathy’s Zero to Hero course.