Honest question: is there a reason for the naming conventions for these models? Anything that makes it better than giving them names with model numbers, like “Claude 3” or such?
Each of the nouns is a “size class” in literature. From small lines poem (haiku, sonnet) to larger story (fable) to very large story (opus) to culture-defining foundational (myth).
It’s a fun way to say how many parameters are in the model without revealing a number like 405B or 17B which isn’t really comparable vs other models.
My non-poetic brain thinks we should call then Mini, nothing, Pro, Max, and then version numbers. Exactly like Apple. It'd be so much easier to parse. Maybe the AI companies like having the affectionate names haha
I personally hate the Mini, Nothing, Pro and Max nomenclature.
I find it very tacky and the confusing having to remember a "base model" exists without any descriptor.
Apple does make it worse though, they have products like the iPhone Pro Max, which combines two!