Point being? Unlike Catholic saints, ChatGPT models actually exhibit these prope...

andrepd · 2025-07-15T13:47:47 1752587267

My grandma also uses saints for actual tasks (e.g. St Anthony for finding lost items), and they exibith those properties in observable ways (e.g. he found her sewing needles just last month). Perhaps the comparison is more appropriate than you realise.

> actually exhibit these properties in directly observable and measurable way

Well but do they? I don't mean your vibes, and I also don't mean cooked-up benchmarks. For example: https://metr.org/blog/2025-07-10-early-2025-ai-experienced-o...

thedevilslawyer · 2025-07-15T23:22:48 1752621768

If not users opinions, or objective benchmarks, then what? Sounds like you prefer closing your ears saying 'nananana...'

TeMPOraL · 2025-07-15T18:42:10 1752604930

> Perhaps the comparison is more appropriate than you realise.

Or perhaps you stop being obtuse. There's no causal connection between "using saints for actual tasks" and the outcomes, which is why we call this religion. In contrast, you can see the cause-and-effect relationship directly and immediately with LLMs - all it takes is going to chatgpt.com or claude.ai, typing in a query, and observing the result.

> Well but do they? I don't mean your vibes, and I also don't mean cooked-up benchmarks.

Do read the study itself, specifically the parts where the authors spell out specifically what is or isn't being measured here.

andrepd · 2025-07-15T21:15:10 1752614110

It's really simple x) either the "observation" is just vibes, and then it's fundamentally the same as when Gran's knees get better after she asks Saint Euphemia, or it's actually a scientific observation, in which case please post! :)

You may not like but it's what it is.

TeMPOraL · 2025-07-16T08:03:22 1752653002

I want to scream Jesus F Christ, but I guess that would be making your point for you :).

Still, it's not what you're saying. Vibes are present, too, but so are concrete effects like one-off utilities conjured in 2 minutes of prompting instead of 10 minutes of searching or 1 hour of coding, or research questions answered swiftly (and answers verified by me), that I normally wouldn't consider tackling due to up-front effort being greater than my discretionary time. And so on.

This is not vibes, but hard, empirical evidence, immediately retrievable as it just accumulates in the chat history, so I can always go back to check if I wasn't misremembering things.