You can fix the people-pleasing mode thing by simply adding the words "be critic...

vessenes · 2025-03-01T04:53:34 1740804814

Hmm. I’m on the other side of this - this feels like what I imagined a scaled up gpt 4 would be: more nuanced and thoughtful. It did the best of any model at my “write an essay as if Hemingway went along with rfk jr when he left the bear in Central Park.” Actual prompt longer. This is a hard task because Hemingway’s prose is extremely multilayered, and his perspective and physical engagement are notable as well.

I’d say 4.5 is by far the best at this of released models. It’s probably the only one that thought through both what skepticism and connection Hemingway might have had along for that day and the combination of alienation posing and privilege rfk had. I just retried deepseek on it: the language is good to very good. Theory of mind not as much.

Edit: grok 3 is also pretty good. Maybe a bit too wordy still, and maybe a little less insightful.

phillipharris · 2025-03-01T08:45:46 1740818746

What was your actual prompt? I just asked it for that Hemingway story and the result didn't impress me -- it had none of the social nuance you mentioned.

farts_mckensy · 2025-03-01T01:51:02 1740793862

No, GPT 4 cannot consistently hold a position for longer than a few minutes in my experience.