And DeepSeek is just 3% behind. It seems in that benchmark all LLMs perform well...

rvnx · 2025-02-18T09:25:15 1739870715

It could also be that they got "inspired" by DeepSeek, hence the very similar results.

So it could be that their success is mostly about taking an open and free thing, and turned it proprietary.

torginus · 2025-02-18T10:38:15 1739875095

These percentage points don't mean anything. Look up how the Elo system works. They just add 1000 to the result to make it a nicer number.

riku_iki · 2025-02-18T15:31:27 1739892687

There are llms below 1000 in the leaderboard

torginus · 2025-02-19T14:30:25 1739975425

So? Percentage points are only meaningful when the mean of the dataset is 0, which is not the case here.