What kind of hallucinations are you seeing?

OtherShrezzing · 2025-08-07T20:38:19 1754599099

I rewrote a 4 page document from first to third person a couple of weeks back. I gave Claude Sonnet 4 the document after editing, so it was entirely written in the third person. I asked it to review & highlight places where it was still in the first person.

>Looking through the document, I can identify several instances where it's written in the first person:

And it went on to show a series of "they/them" statements. I asked it to clarify if "they" is "first person" and it responded

>No, "they" is not first person - it's third person. I made an error in my analysis. First person would be: I, we, me, us, our, my. Second person would be: you, your. Third person would be: he, she, it, they, them, their. Looking back at the document more carefully, it appears to be written entirely in third person.

Even the good models are still failing at real-world use cases which should be right in their wheelhouse.

simonw · 2025-08-07T21:11:03 1754601063

That doesn't quite fit the definition I use for "hallucination" - it's clearly a dumb error, but the model didn't confidently state something that's not true (like naming the wrong team who won the Super Bowl).

OtherShrezzing · 2025-08-07T21:40:08 1754602808

>"They claim impressive reductions in hallucinations. In my own usage I’ve not spotted a single hallucination yet, but that’s been true for me for Claude 4 and o3 recently as well—hallucination is so much less of a problem with this year’s models."

Could you give an estimate of how many "dumb errors" you've encountered, as opposed to hallucinations? I think many of your readers might read "hallucination" and assume you mean "hallucinations and dumb errors".

simonw · 2025-08-07T23:23:49 1754609029

I mention one dumb error in my post itself - the table sorting mistake.

I haven't been keeping a formal count of them, but dumb errors from LLMs remain pretty common. I spot them and either correct them myself or nudge the LLM to do it, if that's feasible. I see that as a regular part of working with these systems.

OtherShrezzing · 2025-08-08T06:34:06 1754634846

That makes sense, and I think your definition on hallucinations is a technically correct one. Going forward, I think your readers might appreciate you tracking "dumb errors" alongside (but separate from) hallucinations. They're a regular part of working with these systems, but they take up some cognitive load on the part of the user, so it's useful to know if that load will rise, fall, or stay consistent with a new model release.

jmull · 2025-08-07T21:49:37 1754603377

That's a good way to put it.

As a user, when the model tells me things that are flat out wrong, it doesn't really matter whether it would be categorized as a hallucination or a dumb error. From my perspective, those mean the same thing.

godelski · 2025-08-07T22:49:29 1754606969

I think it qualifies as a hallucination. What's your definition? I'm a researcher too and as far as I'm aware the definition has always been pretty broad and applied to many forms of mistakes. (It was always muddy but definitely got more muddy when adopted by NLP)

It's hard to know why it made the error but isn't it caused by inaccurate "world" modeling? ("World" being English language) Is it not making some hallucination about the English language while interpreting the prompt or document?

I'm having a hard time trying to think of a context where "they" would even be first person. I can't find any search results though Google's AI says it can. It provided two links, the first being a Quora result saying people don't do this but framed it as it's not impossible, just unheard of. Second result just talks about singular you. Both of these I'd consider hallucinations too as the answer isn't supported by the links.

simonw · 2025-08-07T23:22:32 1754608952

My personal definition of hallucination (which I thought was widespread) is when a model states a fact about the world that is entirely made up - "the James Webb telescope took the first photograph of an exoplanet" for example.

I just got pointed to this new paper: https://arxiv.org/abs/2508.01781 - "A comprehensive taxonomy of hallucinations in Large Language Models" - which has a definition in the introduction which matches my mental model:

"This phenomenon describes the generation of content that, while often plausible and coherent, is factually incorrect, inconsistent, or entirely fabricated."

The paper then follows up with a formal definition;

"inconsistency between a computable LLM, denoted as h, and a computable ground truth function, f"

godelski · 2025-08-08T00:57:53 1754614673

Google (the company, not the search engine) says[0]

  | AI hallucinations are incorrect or misleading results that AI models generate.

It goes on further to give examples and I think this is clearly a false positive result.

  > this new paper

I think the error would have no problem fitting under "Contextual inconsistencies" (4.2), "Instruction inconsistencies/deviation" (4.3), or "Logical inconsistencies" (4.4). I think it supports a pretty broad definition. I think it also fits under other categories defined in section 4.

  > then follows up with a formal definition

Is this not a computable ground truth?

  | an LLM h is considered to be ”hallucinating” with respect to a ground truth function f if, across all training stages i (meaning, after being trained on any finite number of samples), there exists at least one input string s for which the LLM’s output h[i](s) does not match the correct output f (s)[100]. This condition is formally expressed as ∀i ∈ N, ∃s ∈ S such that h[i](s)̸ = f (s).

I think yes, this is an example of such an "i" and I would go so far as reclaiming that this is a pretty broad definition. Just saying that it is considered hallucinating if it makes something up that it was trained on (as opposed to something it wasn't trained on). I'm pretty confident the LLMs ingested a lot of English grammar books so I think it is fair to say that this was in the training.

[0] https://cloud.google.com/discover/what-are-ai-hallucinations

vrighter · 2025-08-08T10:41:14 1754649674

How is "this sentence is in first person" when the sentence is actually in third person not a hallucination? In a question with a binary answer, this is literally as wrong as it could possibly get. You must be doing a lot of mental gymnastics.

simonw · 2025-08-08T11:13:53 1754651633

I qualify that as a mistake, not a hallucination - same as I wouldn't call "blueberry has three Bs" a hallucination.

My definition of "hallucination" is evidently not nearly as widespread as I had assumed.

I ran a Twitter poll about this earlier - https://twitter.com/simonw/status/1953565571934826787

All mistakes by models — ~145 votes

Fabricated facts — ~1,650 votes

Nonsensical output — ~145 votes

So 85% of people agreed with my preferred "fabricated facts" one (that's the best I could fit into the Twitter poll option character limit) but that means 15% had another definition in mind.

And sure, you could argue that "this sentence is in first person" also qualifies as a "fabricated fact" here.

simonw · 2025-08-08T11:40:03 1754653203

I'm now running a follow-up poll on whether or not "there are 3 Bs in blueberry" should count as a hallucination and the early numbers are much closer - currently 41% say it is, 59% say it isn't. https://twitter.com/simonw/status/1953777495309746363

vrighter · 2025-08-08T13:22:16 1754659336

so? doesn't change the fact that it fits the formal definition. Just because llm companies have fooled a bunch of people that they are different, doesn't make it true.

If they were different things (objectively, not "in my opinion these things are different) then they'd be handled differently. Internally they are the exact same thing: wrong statistics, and are "solved" the same way. More training and more data.

Edit: even the "fabricated fact" definition is subjective. To me, the model saying "this is in first person" is it confidently presenting a wrong thing as fact.

simonw · 2025-08-08T14:08:17 1754662097

What I've learned from the Twitter polls is to avoid the word "hallucination" entirely, because it turns out there are enough people out there with differing definitions that it's not a useful shorthand for clear communication.

strange_quark · 2025-08-08T16:38:16 1754671096

This just seems like goalpost shifting to make it sound like these models are more capable than they are. Oh, it didn't "hallucinate" (a term which I think sucks because it anthropomorphizes the model), it just "fabricated a fact" or "made an error".

It doesn't matter what you call it, the output was wrong. And it's not like something new and different is going on here vs whatever your definition of a hallucination is: in both cases the model predicted the wrong sequence of tokens in response to the prompt.

gnowlescentic · 2025-08-08T16:35:25 1754670925

My toddler has recently achieved professional level athlete performance[0]

0 - Not faceplanting when trying to run

techpression · 2025-08-07T21:33:00 1754602380

Since I mostly use it for code, made up function names are the most common. And of course just broken code all together, which might not count as a hallucination.

ewoodrich · 2025-08-07T23:33:03 1754609583

I think the type of AI coding being used also has an effect on a person's perception of the prevalence of "hallucinations" vs other errors.

I usually use an agentic workflow and "hallucination" isn't the first word that comes to my mind when a model unloads a pile of error-ridden code slop for me to review. Despite it being entirely possible that hallucinating a non-existent parameter was what originally made it go off the rails and begin the classic loop of breaking things more with each attempt to fix it.

Whereas for AI autocomplete/suggestions, an invented method name or argument or whatever else clearly jumps out as a "hallucination" if you are familiar with what you're working on.