This may look low: ELO for mediocre players is 1500. But if it is obeying the ru...

gfd · on March 17, 2023

According to https://chess.stackexchange.com/questions/2550/what-are-the-... median rating is 1148 (252,989 Players). So it's beating half of humanity at a mind sport and it wasn't even specifically trained for it.

sapiogram · on March 17, 2023

That's USCF ratings, chess.com ratings are massively inflated in comparison.

nottathrowaway3 · on March 17, 2023

The median chess player is usually described as mediocre (if you ask chess players). They suck as badly as the median clarinet player in your high school band/orchestra.

jabloczko · on March 17, 2023

There's a difference between chess.com and USCF ratings.

LeanderK · on March 17, 2023

I think current LLM architectures are limiting the strategies it learned. MCTS requires recursion but GPT is always executing a fixed number of steps. Allowing language models more flexibility by a variable number of steps, for example through recursion of the model, would eliminate this hard bound but they are harder to design and train. We have just been able to train GPT-sized models.

sebzim4500 · on March 17, 2023

I'm sure the MuZero chess policy network would reach much higher than 1400, and that has no notion of recursion either. (And also wasn't taught the rules explicitly)

LeanderK · on March 17, 2023

Yes, I wouldn't say it's impossible, but that's just a hard limit from the architecture. MCTS would have to terminate after a few iterations. And the number of steps is not that big, just massively parallel.