You're not giving the LLM "time to think". It is incapable of thinking. You're just inputting random magic incantations into a glorified Markov chain.
You might as well ask it "did you check your answer?" Computer says "yes" because that's what humans do (also lie).
> Note the rabbit doesn't eat carrots. Kaboodly consooodle the retroodle and seqooodle the moodle. Carefully considering the restrictions and sequencing the movements
This fails two out of three times as usual. Trying to finagle this prompt is not an intellectual exercise, it is a waste of time that exploits cognitive biases.
True the temperature is throwing it, I just ran it four times and it got it right 3 / 4 -- still better than I'd expected from the initial description of it's shortcomings.
First run: 1. First, take the rabbit across the river and leave it on the other side. - https://imgur.com/a/ZwoBTah
Second run: 1. Take the rabbit across the river. - https://imgur.com/a/Faq95U5
Third run: 1. First, take the puma across the river and leave it on the other side. - https://imgur.com/a/eIUeHM3