https://github.com/adamkarvonen/chess_gpt_eval
I expect the rest to be much worse if 4's performance is any indication
https://github.com/adamkarvonen/chess_gpt_eval
I expect the rest to be much worse if 4's performance is any indication