It's very doubtful that they'd have any kind of magical breakthrough that makes ...

sosodev · 2025-08-07T16:42:37 1754584957

How do you figure? They’ve hinted that the reasoning breakthrough used to achieve gold in the IMO will be here in GPT-5.

gowld · 2025-08-07T16:46:44 1754585204

What breakthrough? The self-awarded "gold" IMO result was achieved by running the model for over 1hr per question.

sosodev · 2025-08-07T16:52:24 1754585544

That sounds like a breakthrough to me. I don’t think GPT-4 could accomplish the same thing given several hours to try.

vlovich123 · 2025-08-07T16:53:57 1754585637

Said another way, 30 min less than what humans get? It’s on average 90 min per question.

mtlmtlmtlmtl · 2025-08-07T17:05:09 1754586309

And how much energy does a human being consume while spending 90 minutes on an IMO question?

jjmarr · 2025-08-07T17:16:58 1754587018

Probably more. 200 kcal (a shrinkflated bag of chips) is about 232 watt hours. A typical 4o query is 0.3 to 3 watt hours.

https://epoch.ai/gradient-updates/how-much-energy-does-chatg...

SketchySeaBeast · 2025-08-07T17:23:31 1754587411

But how much time does that 0.3 watt hour query take to run? They imply that an individual ChatGPT query takes 0.3-3 watt hours, but most queries come back in seconds, so we need to scale that over a whole hour of processing.

Edit: Scrolling down: "one second of H100-time per query, 1500 watts per H100, and a 70% factor for power utilization gets us 1050 watt-seconds of energy", which is how they get down to 0.3 = 1050/60/60.

OK, so if they run if for a full hour it's 1050*60*60 = 3.8 MW? That can't be right.

Edit Edit: Wait, no, it's just 1050 Watt Hours, right (though let's be honest, the 70% power utilization is a bit goofy - the power is still used)? So it's 3x the power to solve the same question?

andai · 2025-08-07T16:48:26 1754585306

The gold which Google won too, right?

og_kalu · 2025-08-07T17:03:44 1754586224

No Sam explicitly said that breakthrough wouldn't be in GPT-5