I just tried some logic puzzles on the Advanced model, and was not impressed. It...

DalasNoin · 2024-02-08T18:01:53

keep in mind that all the common logical puzzles have probably been tried hundreds of times by chatgpt users and are now part of the training set.

fl7305 · 2024-02-08T22:32:35

I tried the "pull or push a glass door with mirror writing".

I feel it's a huge difference between GPT-4, which seems to be able to reason logically around the issue and respond with relevant remarks, and Gemini Gemini Advanced which feels a lot more like a stochastical parrot.

Gemini quickly got confused and started talking about "pushing the door towards yourself" and other nonsense. It also couldn't stay on point, and instead started to regurgitate a lot of irrelevant stuff.

GPT-4 is not perfect, you can still hit things where it also breaks down.

vitorgrs · 2024-02-09T00:42:15

Maybe, but GPT4 got these puzzles right at the launch.

camel_Snake · 2024-02-09T03:33:26

it says in the graphs listed on the announcement it performs worse than GPT4 on reasoning benchmarks.