> AI already blows most people out of an IQ test at a fraction of the computatio...

jstanley · 2025-08-28T07:57:19 1756367839

I scrolled down but didn't find a chart comparing average human performance to AI performance.

The only chart I found was comparing the costs of different models.

tylerhou · 2025-08-28T08:42:45 1756370565

Sorry, you're right that the chart on the home page does not have human performance. The leaderboard chart does: https://arcprize.org/leaderboard. And the leaderboard by default shows scores for ARC-AGI 1 and 2. The models are much worse at 2 than 1; the best performing model scores around 15% (Grok 4, thinking), while humans are at ~100%.

jstanley · 2025-08-28T09:27:05 1756373225

Thanks, and do we know if the humans are average people off the street, or unusually-intelligent people?

EDIT: OK, I see there are 3 types of humans:

"Avg. Mturker" does worst. "Stem Grad" and "Human Panel" are basically equivalent in terms of quality but differ in cost.

It's not obvious to me whether an average Mturker would be more or less clever than the average person. Mturk doesn't pay very well, so you'd think you'd have to be below average to want to do it. But potentially it attracts people of above-average intelligence who just happen to live in the third world?

rsynnott · 2025-08-28T10:13:01 1756375981

Additional caveat: some of the "avg mturker" cohort are almost certainly using LLMs to participate.