This made me chuckle. You made a very interesting point that if LLMs are copying...

Terr_ · 2025-01-01T02:42:20 1735699340

Simpler than that: It's all hallucinations, some of them just happen to be ones humans approve-of.

It's kind of like a manufacturer of Ouija boards promising that they'll fix the "channeling the wrong spirits from beyond the mortal plane" problem. It falsely suggests that "normal" output is fundamentally different.

JohnMakin · 2024-12-31T23:48:25 1735688905

This is a great insight and fascinating to me as well. What even is the solution though? It does seem like it follows logically though, since the earliest days of the internet huge swaths of wrong, fraudulent, or misleading info has plagued it and you’d usually have been wise to check your sources when trusting anything you read online. Then we had these models ingest the entire web, so we shouldn’t be surprised at how often it is confidently wrong.

vimgrinder · 2025-01-01T10:22:49 1735726969

I guess reasoning and healthy self-doubt to be built in system. Already the reasoning thing seems like 2025's candidate for what large labs will be zeroing down on.

karmakaze · 2024-12-31T22:59:42 1735685982

This is the interesting part of the experiment. Since these LLMs are general and not specifically trained on historical (and current) stock prices and (business) news stories, it isn't a measure of how good they could be today.

vimgrinder · 2024-12-31T23:20:36 1735687236

My first through after seeing this post was that it's a real world eval. We are running out of evals lately (arc-agi test, then sudden jump on frontier math, etc). So it's good to have such real world tests which show how far we are.

mvdtnz · 2024-12-31T22:13:51 1735683231

If you believe (as many HNers do, although certainly not me) that LLMs have intelligence and awareness then you necessarily must also believe that the LLM is lying (call it hallucinating if you want).

sdwr · 2024-12-31T23:05:05 1735686305

Intelligence is a prerequisite for lying, but its foundation is morality and agency.

To lie, you have to know that you are not telling the truth, and arguably have to be able to held accountable for that action.

It's easy to babble a series of untruths, but lying requires intention, which requires an entity that can be recognized as having intentions.

I'd argue that ChatGPT's lack of a cohesive self prevents it from lying, no matter how many untruths it creates.

jhghikvhu · 2025-01-01T02:59:50 1735700390

If you ask chatgpt to tell a story of a liar it is able to do so. So while it doesn't have a motivated self to lie for it can imagine a motivated other to project the lie on.

vimgrinder · 2024-12-31T22:44:43 1735685083

Reminds me of recent paper where they found LLMs are scheming to meet certain goals; And that is a scientific paper done by a big lab. Are you referring from that context?

Words and their historical contexts aside, systems which are based on optimization can take actions which can appear like intermediate lying to us. When deepmind used to play those atari games - the agents started cheating but that was just optimisation wasn't it? similarly when a language based agent does a optimisation, what we might perceive it as is scheming/lying.

I will start believing that LLM is self aware when a research paper from a top lab like Deepmind/Anthropic put such a paper in a peer reviewed journal. Otherwise, it's just matrix multiplication to me so far.

Terr_ · 2025-01-01T02:49:10 1735699750

> [paper claimed] LLMs are scheming

IMO a much better framing is that the system was able to autocomplete stories/play-scripts. The document was already set up to contain a character that was a smart computer program with coincidentally the same name.

Then humans trick themselves into thinking the puppet-play is a conversation with the author.