Are there any specifics about how this was trained? Especially when 5.1 is only ...

emp17344 · 2025-12-11T18:57:19 1765479439

I’m extremely skeptical because of all those articles claiming OpenAI was freaking out about Gemini - now it turns out they just casually had a better model ready to go? I don’t buy it.

Workaccount2 · 2025-12-11T20:41:59 1765485719

I (and others) have a strong suspicion that they can modulate models intelligence in almost real time by adjusting quantization and thinking time.

It seems if anyone wants, they can really gas a model up in the moment and back it off after the hype wave.

qeternity · 2025-12-12T01:28:36 1765502916

Quantization is not some magical dial you can just turn. In practice you basically have 3 choices: fp16, fp8 and fp4.

Also thinking time means more tokens which costs more especially at the API level where you are paying per token and would be trivially observable.

There is basically no evidence that either of these are occurring in the way you suggest (boosting up and down).

Workaccount2 · 2025-12-12T03:25:02 1765509902

API users probably wouldn't be affected since they are paying in full. Most people complaining are free users, followed by $20/mo users.

bamboozled · 2025-12-12T00:59:22 1765501162

Yeah I've noticed with Claude, around the time of the Opus 4.5 release, at least for a few days, Sonnet 4.5 was just dumb, but it seems temporary. I feel that redirected resources to Opus.

tempaccount420 · 2025-12-11T18:58:56 1765479536

They had to rush it out, I'm sure the internal safety folks are not happy about it.

robots0only · 2025-12-12T03:01:41 1765508501

how do you know this is a better model? I wouldn't take any of the numbers at face value especially when all they have done is more/better post-training and thus the base pre-trained model capabilities is still the same. The model may just elicit some of the benchmark capabilities better. You really need to spend time using the model to come to any reliable conclusions.

bamboozled · 2025-12-12T00:58:03 1765501083

It's very inline with their PR strategy, or lack of.

kouteiheika · 2025-12-11T18:46:32 1765478792

Unfortunately there are never any real specifics about how any of their models were trained. It's OpenAI we're talking about after all.