I fed it yesterday's and it did essentially a perfect run, even though it wasn't certain about 1 of the words in the 3rd guess.
Today it got 1 error, but only because it correctly identified palindromes but there were 5 and it picked the wrong one, and it was its first guess. After that it swiftly found all the categories.
I also fed today's same prompt to the o1-mini and it just outright sucked, even repeated an incorrect guess immediately after being told it was wrong.
There's definitely quite a leap between the mini and the full o1.
Today it got 1 error, but only because it correctly identified palindromes but there were 5 and it picked the wrong one, and it was its first guess. After that it swiftly found all the categories.
I also fed today's same prompt to the o1-mini and it just outright sucked, even repeated an incorrect guess immediately after being told it was wrong. There's definitely quite a leap between the mini and the full o1.