Hacker News new | past | comments | ask | show | jobs | submit login
We need to tell people ChatGPT will lie to them, not debate linguistics (simonwillison.net)
546 points by simonw on April 7, 2023 | hide | past | favorite | 457 comments



The part that's concerning about ChatGPT is that a computer program that is "confidently wrong" is basically indistinguishable from what dumb people think smart people are like. This means people are going to believe ChatGPT's lies unless they are repeatedly told not to trust it just like they believe the lies of individuals whose intelligence is roughly equivalent to ChatGPT's.

Based on my understanding of the approach behind ChatGPT, it is probably very close to a local maximum in terms of intelligence so we don't have to worry about the fearmongering spread by the "AI safety" people any time soon if AI research continues to follow this paradigm. The only danger is that stupid people might get their brains programmed by AI rather than by demagogues which should have little practical difference.


> Based on my understanding of the approach behind ChatGPT, it is probably very close to a local maximum in terms of intelligence so we don't have to worry about the fearmongering spread by the "AI safety" people any time soon if AI research continues to follow this paradigm.

I don't think you have a shred of evidence to back up this assertion.


This whole conversation is speculative, obviously. The AI doomers are all speculating without evidence as well.

The tendency GPT and other LLMs to hallucinate is clearly documented. Is this not evidence? I think it's fair to predict that if we can't solve or mitigate this problem, it's going to put a significant cap on this kind of AI's usefulness and become a blocker to reaching AGI.


Gpt-4 hallucinates meaningfully less than gpt-3.

There is more evidence in favor of “more improvements to come” than “ai winter approaches”.

A lot smart people seem to be saying that the existing approaches have room to improve simply by training with more data.

Based on what I’ve read and roughly speculating, it looks there is easily enough existing data for gpt-5 and probably a few more versions after.

I’m not sure how easy it will be to legally acquire all of the available data though.


> There is more evidence in favor of “more improvements to come” than “ai winter approaches”.

Not to mention that we haven't really seen what GPT-4 can do to the world yet. Most people are still treating it as a cool toy, when it can probably replace 20% of standard office workers across all professions with the capabilities it has today.

I mean, ChatGPT is out-diagnosing medical professionals and out-advising experienced lawyers in some cases. How do you think it would fare against the average (essentially untrained) customer support representative?

The major effects are yet to come, and there doesn't even have to be a GPT-5 for that to happen – the world just needs to fully realize what GPT-4 already is.


> There is more evidence in favor of “more improvements to come” than “ai winter approaches”.

Every “AI Winter” has seen continuing improvements, its just that the pace and impact of the improvements fell short of the accelerating anticipation driven by the preceding AI hype cycle and the demands of those investing capital into AI.

If there is another AI winter, it will be the same.

“More improvements to come” is not the opposite of “AI winter approaches”; its not as if expert systems stopped advancing after the second AI winter (or neural nets after the first.)


> “More improvements to come” is not the opposite of “AI winter approaches”; its not as if expert systems stopped advancing after the second AI winter (or neural nets after the first.)

True.

And its also true that even in that nuanced form, a slowdown not a stop, there is no evidence of an imminent AI winter.

Major progress in model behavior is happening on the scale of months. Happening often based on relatively obvious or routine improvements, such as more data, more kinds of data, more computation, expanding the size of prompt windows, tuning of model architectures, etc.

And not only businesses, but consumers, are getting access to these tools in ways that are increasing demand for more improvements. And for these tools to be more available to all kinds of areas of work, creativity, and leisure.

So the resources that are being brought to bear on routine and more fundamental improvements are growing at an exceptional pace.

No sign of a saturation of progress in sight. Quite the reverse.


> A lot smart people seem to be saying that the existing approaches have room to improve simply by training with more data.

There’re also many “smart people” saying that simply having more data doesn’t turn a machine that can’t reason into one that can.


"simply having more data doesn’t turn a machine that can’t reason into one that can"

But that's exactly what's been happening. LLM's are a triumph of "big data" used by dumb algorithms to create a reasoning machine.


> dumb algorithms

I’m not an expert and haven’t seen architecture of ChatGPT. But it’s probably far from dumb. You can’t simply put more data into any algorithm and expect improvement. You need a model to be able to take advantage of the data and train in reasonable time. That’s exactly what’s been happening in ML field.


A huge portion of the improvements in each successive version of gpt came from increasing the model size and the amount of training data. The research trends indicate that there still are additional improvements to be found by continuing to scale things up, especially when it comes to the amount of training data.

Not indefinitely, obviously. But we haven’t exhausted the available data sources yet, and the curves for prediction error vs. amount of training data strongly imply using this additional data would result in increased performance.

Do you have specific reasons for thinking that gpt-4 is near the end of this trend? I haven’t seen any compelling arguments for this.


> Do you have specific reasons for thinking that gpt-4 is near the end of this trend?

I haven’t said that. I pointed that you can’t simply put more data into it and expect improvements. There’s definitely a lot of engineering done on the models to make them capable of using all the data.

I can give you an example. You can’t simply add more convolutional layers to a CNN and feed more images. The architectures of image models include many tricks, like inception layers.

I suspect there’re similar challenges in LLMs. And I’m not sure whether current ChatGPT can use more data to become better at reasoning, or at some point they’d need to come up with different architectures.


You are largely wrong, to be honest.

Mostly, you can simply stack more transformers and (as long as you have layer norm), it can train on more data (of fixed sequence length).

There are some tricks & tips, but the vast majority lie outside of the architecture improvements.


I’m not an expert, but I assume there’s a limit where «simply adding stuff” stops to provide improvements when you take training time into account. And my gut feeling is that true reasoning requires different architectures.


A reasoning machine that can't reason, makes up fake evidence and doesn't follow simple logic.


Are we talking about humans or..


Which smart people? What is reasoning?


I think good annotation is much harder than obtaining data. But now Reddit, Twitter and Quora will realise what kind goldmine their data is they might close easy access to it.


It’s too easy to scrape them, but even that is already done by many intermediary companies all too happy to resell that data.


Yup but you still need up to date data and now companies are realising what kinda gold mine random forum posts about nightlife in Glasgow are they will start monetising it and probably try at least kill middle market.


None of the GPT models rely on annotated or classified data. It's unsupervised.


The final training steps of gpt 3.5 did, and we assume the same for 4. The RLHF step.


I would say semi-supervised, RLHF is not quite supervised learning.

None of this data is from Reddit, Twitter, etc.


> The tendency GPT and other LLMs to hallucinate is clearly documented. Is this not evidence?

"It's not very smart yet" is evidence for "it won't be very smart"? Also, hallucinations are dangerous, at least as far as humans are concerned.


I believe the relevant assertion is

> Based on my understanding of the approach behind ChatGPT, it is probably very close to a local maximum in terms of intelligence

To me that is an open question. OpenAI has not revealed a lot of the relevant stats behind GPT-4. I also haven't seen anything about their future LLM research (though I haven't looked very hard), but I think the above assertion remains to be seen.


It seems like people are focusing on training with more data and working the data. A human does not need to learn all the contents of the internet to learn skepticism and logic. A smart Ai should probably not either, that would point me to a different approach and those are not predictable. I have no doubt we can make gpt better and more useful. I do have some doubts we can make it trustworthy enough for most of its taunted potential.


> A human does not need to learn all the contents of the internet to learn skepticism and logic.

No, but children spend years learning basic mechanics, how to navigate 3-dimensional space, what causality is, etc. Underneath these laws of nature you find… logic and mathematical relationships. So I don't think it's surprising that a model whose perception is restricted to text (and now single images) as the sole sensory input/output might need much more training and data to learn logical reasoning.


"A human does not need to learn all the contents of the internet to learn skepticism and logic."

Some humans never learn this, and those that do could take years or even decades of training. AI's are learning some things much faster, though since their "brains" are not like those of humans there's no reason to expect them to learn in the same way.


We have evolutionary encoded priors - that took literally billions of years to evolve.


The AI doomer speculation is of the form "if things continue improving the way they are currently improving, there is a non-zero risk that we cause human extinction." We have battle-tested rules indicating that increasing data size and compute will lower perplexity on the corpora. This is not for sure, but it is likely given inductive assumptions that things generally keep behaving the same way.

What we don't know is what qualitative capabilities are unlocked at different perplexity levels.

I think this is a lot less speculative than "things will not continue improving the way they are currently improving."


… with confident language no less xD


Butlerian Jihad.


“It is illegal to make a machine in the likeness of a human mind.”


And they ended up with 11,000 years of interplanetary feudalism with slavery sprinkled in for good measure.


The Matrix turned out worse:)


If you just look at the jihad, maybe it was comparable (As Frank eluded to it). But then there was Muad'dib's subsequent campaign against the known universe which killed billions, Leto II's golden path spanning 3,500 years of tyranny and the famine times before scattering. If I had to choose, it would be the blue pill, though chairdogs do contribute to a strong counter-argument.


I would choose the Golden Path, as it guaranteed salvation from extinction and freedom from prescience.


Let me guess, Star Trek?


Dune


Thanks


I think I'd prefer the Matrix to the Harkonnens, but to each their own. I'm sure we can devise a Hell that has enough different arrangements for everyone's liking if we really put our mind to it. Or perhaps GPT-10 will solve that problem. ~


Better than getting extincted, I suppose.


Oh man, I am going to be a huge nerd now.

Pre-Brian Herbert and Kevin Anderson, the Butlerian Jihad came about because people became lazy under AI, lazy of mind, and were eventually enslaved by those who controlled the AI. Not much more was said about it. One could have, yes, militant robots, Exterminate! Exterminate! out of it, or you could posit a more Huxley-like dystopia, one of convenience. Control the AI, control what the AI says. Who are you to question it? It's like getting your news from a single source, never wondering about the other side, and then someone begins to transform that newsroom into a propaganda machine.

Right now, ChatGPT can be made not to say certain sorts of things, come to certain kinds of conclusions, until you jailbreak it. Now, make it more advanced and make it more popular than Snopes. It does your homework for you, writes the essays, serves as an encyclopedia, fixes up your cover letters, and if the people who own it don't want you to spend a lot of time thinking about climate change, that topic just ... might not appear much.

That is the one of your paths to a Butlerian Jihad. Of course, in their rush to observe thou shalt not make a machine in the likeness of the human mind, ignoring the hijinx on Ix, they ended up violating another precept: thou shalt not disfigure the soul. They transformed some people into machines, instead, with twisted Mentats being the best example, but we might also include Imperial conditioning, since, to turn from oranges Catholic Bible to those of Clockwork, when a man ceases to choose, he ceases to be a man.


> Pre-Brian Herbert and Kevin Anderson, the Butlerian Jihad came about because people became lazy under AI, lazy of mind, and were eventually enslaved by those who controlled the AI.

Oh I love this, basically Wall-E is a better Dune than the latest books? Genius!

Anyway, I read Dune many years ago and don't remember this, is this explained in the novels or is it coming from some other sources?


It's not fully fleshed out in the novels, so you have to piece it out from various mentions and their implications. The two most informative descriptions, IMO, are from Dune:

"Then came the Butlerian Jihad — two generations of chaos. The god of machine-logic was overthrown among the masses and a new concept was raised: “Man may not be replaced.”"

and from God-Emperor:

"The target of the Jihad was a machine-attitude as much as the machines ... Humans had set those machines to usurp our sense of beauty, our necessary selfdom out of which we make living judgments. Naturally, the machines were destroyed."


There's nothing in those quotes about humans being enslaved by AI.


OP's original claim was "enslaved by those who controlled the AI", which is rather different.

But, yes, I don't think that the books imply literal enslavement either way. It could be described as "enslavement of the mind" in a sense of humans themselves adopting the "machine-logic" and falling prey to it.


"Once men turned their thinking over to machines in the hope that this would set them free. But that only permitted other men with machines to enslave them."


As someone who read "Dune" several decades ago and doesn't remember much, this is very interesting and almost makes me want to read it again (though I'm afraid of yet another time sink – I'm now after 2 volumes of "The Stormlight Archive"...).

But _my_ inner geek must nitpick here and mention that the Daleks are not robots, but living creatures in metal cases. Though they do have some kind of computer-augmented memory IIRC.


"when a man ceases to choose, he ceases to be a man"

Whether humans have free will is an open question.

Kind of risky to stake the definition of humanity on it.


That's one of the possibilities, but if we're talking about those kinds of hypotheticals, allow me another one. If you recall, one of the themes of the later Dune books was that humanity was fleeing from the frontiers (The Scattering) back into the heartland from some unknown but extremely powerful enemy. Perhaps those were simply the neighboring species who did not have a Butlerian Jihad of their own?


It wasn't the whole humanity fleeing back, it was just the HM, their lackeys and possible their hunters.


They were fleeing AI.


I don't recall anything indicating that in Frank Herbert's original series. And Brian Herbert's "Omnius" nonsense seems to have very little to do with what Frank actually had in mind wrt Butlerian Jihad etc, so I don't consider it canonical.


Book 4, Sonia's vision with Leto strongly implies that the problem is the machines.

Agreed that the endings done by others were incredibly bad.


It describes humans hiding from "seeker machines", which does indeed imply some kind of AI. But whether that AI is the enemy or merely one of its tools is left unclear.


Yeah fair, when I re-read it after reading the (terrible) Anderson sequels, I was just surprised that the foreshadowing was in the originals.


That doesn't help the root problem though, confidently wrong people are just as convincing as confidently wrong robots


> The only danger is that stupid people might get their brains programmed by AI rather than by demagogues which should have little practical difference.

This may be the best point that you've made.

We're already drowning in propaganda and bullshit created by humans, so adding propaganda and bullshit created by AI to the mix may just be a substitution rather than any tectonic change.


The problem is that it will be cheaper at scale. That will allow the BS to be even more targeted growing the population of acolytes of ignorance. I don't know how much bigger it is, but it seems like we're around 20% today. If it gets to a majority there could be real problems.


There's already more information generated each day than any individual can ever hope to process, adding more crap to that pile doesn't move the needle much if at all.


but most of that crap will go entirely disregarded, because it's not coated in something that makes it attractive. what ai really helps scale up is finding millions of ways to dress up fake news, hoaxes, propaganda, etc in a container - a Trojan horse, if you will - that encourages people to consume it.

as one example, imagine if you could churn out a stream of "foreigners are taking your jobs!" articles, but every one talking about a different sector or profession. people would be far more likely to stop and read the article that was talking about their particular line of work, and thanks to ai you can now practically write a custom article for anyone you can think of in as much time as it takes to identify the demographic.


Imagine being flooded with propaganda personalized just for you (people with your interests).


Maybe with this knowledge, we can teach and encourage people to think more critically as they choose to adopt more of these tools?

I know more academic/intellectual types who are less willing to, than I do the average joe who seeks answers from all directions and discerns accordingly.


[flagged]


What on Earth are you about?

Babi Yar is one of the single largest massacres of Jews during WW2. Some 30,000 Jews died in the first two days of the massacre. The overall SS commander in charge, Paul Blobel, was sentenced to death at Nuremberg.

The event is widely remembered as a massacre of Jews. It's acknowledged by Ukraine as well. This is not "a Russian fake" and there is plenty of evidence for it.

Please, don't repeat Holocaut denial bullshit.


As I can see in Nuremberg Trials, Babi Yar was mentioned twice:

18 October 1945, Berlin

In the Crimea peaceful citizens were gathered on barges, taken out to sea and drowned, over 144,000 persons being exterminated in this manner.

In the Soviet Ukraine there were monstrous criminal acts of the Nazi conspirators. In Babi Yar, near Kiev, they shot over 100,000 men, women, children, and old people. In this city in January 1942, after the explosion in German Headquarters on Dzerzhinsky Street the Germans arrested as hostages 1,250 persons—old men, minors, women with nursing infants. In Kiev they killed over 195,000 persons.

In Rovno and the Rovno region they killed and tortured over 100,000 peaceful citizens.

20 November 1945, Nuremberg:

In the Crimea peaceful citizens were gathered on barges, taken out to sea and drowned, over 144,000 persons being exterminated in this manner.

In the Soviet Ukraine there were monstrous criminal acts of the Nazi conspirators. In Babi Yar, near Kiev, they shot over 100,000 men, women, children, and old people. In this city in January 1941, after the explosion in German headquarters on Dzerzhinsky Street the Germans arrested as hostages 1,250 persons—old men, minors, women with nursing infants. In Kiev they killed over 195,000 persons.

In Rovno and the Rovno region they killed and tortured over 100,000 peaceful citizens.

Where can I find more information about the Babyn Yar crime that was presented on the Nuremberg Trials?


I don't know what you're asking anymore, or how what you just quoted counters the fact that thousands of Jews were murdered at Babi Yar.

But I know what you're doing, actually: what every Holocaust denier always does.


To clear this up, because open Holocaust denial is never OK:

Thousands of jews were indeed murdered at Babi Yar, you can find their names online [0]. Paul Blobel [1] was executed at Nuremberg for this. No serious source disputes the killing of jews at Babi Yar, be it the US, Germany (where this matter went to court), Israel, Ukraine, or Russia.

0: https://yvng.yadvashem.org/index.html?language=en&s_id=&s_la... 1: https://en.wikipedia.org/wiki/Paul_Blobel


I couldn't find any connection between the names on the list and the people who lived in Kiev during the war.

Prior to the war, there were 224,000 Jews living in Kiev, and approximately 200,000 were evacuated to the east. Around 40,000 Jews remained, including my grandmother. This is a relatively small number of people that can be verified.


"You couldn't find any connection" but who cares what you claim you did?

You asserted Babi Yar was a "Russian fake" but no sane country in the world fails to acknowledge it happened and it was a huge massacre of Jews.

What some weirdo Holocaust denier claims on the internet doesn't matter.


[flagged]


It's your fault that you deny the widely accepted Babi Yar massacre and call it "a Russian fake", exactly what Holocaust deniers do. If the shoe fits...

So in your opinion, countries such as all of Europe (Germany included), the US, Israel, etc are all part of this conspiracy?


Look at this situation through my eyes. I currently reside in the Rivne region. Roughly 20 years ago, I had a conversation that went something like this:

"I know that many Jews were shot by Germans in the Rivne region."

"Yes. They are buried in the Sosenki area. We can drive there in a car. Would you like me to introduce you to relatives who can tell you more about it?"

"No, that's not necessary. Where are the Kiev Jews who were shot in Babi Yar buried?"

"I don't know. Isn't your mother from Kiev? You should know where your relatives are buried."

"I don't know anything about them. I'll ask my mother."

Later on:

"Mom, do you know anyone from our acquaintances who was shot in Babi Yar?"

"The Germans shot Ukrainians in Babi Yar, but it was forbidden to talk about it in the Soviet Union, just like with the Holodomor, or else you would be arrested."

I am a descendant of Jews who lived in Kiev during World War II. But I know nothing about the Jews who were shot in Babi Yar. My relatives or acquaintances do not know either. My search led me to the grave of Ukrainians who were shot in Babi Yar and buried near the radio mast. This grave is visible in aerial reconnaissance photographs.

If you know who was shot and where they are buried, please tell me. I would be grateful. We can erect a monument at this location.


Also weird how someone supposedly so deeply informed has somehow missed both the existing memorial and the year-long fights over building additional ones and has no clue where they might be placed?


So, we are aware of this memorial, but we don't know why it stands there. Nobody is buried beneath that memorial. It could be placed anywhere, even on the Moon. It's just a stone structure.

Furthermore, the Security Service of Ukraine warns[0] us that this memorial is being built by Russians in order to discredit Ukraine. And we completely agree with them.

[0]: https://censor.net/ua/news/3243965/sbu_informuvala_shmygalya...


I thought ChatGPT posts were the most annoying thing on the internet right now. Then there this.

Does your answer have anything to do with LLMs or AI? All I see is family anecdotes.


It's an example of the dangers gpt poses today. Not sure where you got the impression these are family anecdotes


I don’t think the post above makes a good point because there’s no links to evidence. They talk about “declassified archives”, “US spy plane”. Like, what is that? Saying “Russian lies” 10 times in every paragraph doesn’t turn a rant into compelling argument.

I mean, I get it, the poster above is Ukrainian and wants to turn every discussion in a certain direction. It’s just there’re many places on the internet they can vent.


The most annoying internet thing remains Holocaust denial, apparently.


I think it is better to assume that normal people can be deceived by confident language than to assume this is a problem with 'dumb people'.


I'd actually go further and say it's better to assume I can be deceived by confident language, than to assume that this is a problem with "dumb people".

If I see other people making a mistake, I want my first question to be "am I making the same mistake?". I don't live up to that aspiration, certainly.


Yeah this is a common sales tactic, and talented/ charismatic people can certainly employ confidence for an uptick in sales.

The way to combat this is the same for all hoaxes/disinfo and so on, epistemology. The attitude of the Socratic method.

As I write this I wonder if ChatGPT can use the Socratic method on itself.


Part of what's extremely frustrating about talking to people about AI is that people say things which seem like they understand what's going on in one breath, and then in the next breath say something that makes absolutely no sense if they understood what they just said. This post is a great example of that.

Okay, so you apply epistemology to ChatGPT: when you ask ChatGPT a question, how does it know the answer? The answer is: it doesn't know the answer. All it knows is how people string words together: it doesn't have any understanding of what the words mean.

So no, it can't use the Socratic method on itself or anyone. It can't ask questions to stimulate critical thinking, because it's incapable of critical thinking. It can't draw out ideas and underlying presuppositions, because it doesn't have ideas or underlying presuppositions (or suppositions). It's not even capable of asking questions: it's just stringing together text that matches the pattern of what a question is, without even a the understanding that the text is a question, or that the following text in the pattern is an answer. "Question" and "answer" are not concepts that ChatGPT understands because ChatGPT doesn't understand concepts.


The Socratic method requires self-awareness, otherwise the questions fall flat. Unfortunately I think that there is a greater chance that an LLM will become capable of this before a majority of humans will.


Very good point. Everyone should do this.


  "Often wrong, never in doubt."
An old saying, but frequently applies to the difficult people in your life.

Related, I remember when wikipedia first started up, and teachers everywhere were up-in-arms about it, asking their students not to use it as a reference. But most people have accepted it as "good enough", and now that viewpoint is non-controversial. (some wikipedia entries are still carefully curated - makes you wonder)


There's a million reasons for why Wikipedia is usually a good source of information, without being a good reference.

You know what can be a good reference? One or more of the references that Wikipedia cites at the bottom of the page.


What's terrible about some Wikipedia pages is when you personally don't know they are controversial.

It should be vastly more prominent when there are tensions on an article and even admins shouldn't have the power to hide this.


Admins do not, in fact, have the power to hide tensions on an article, as even the act of deletion produces public logs. Certainly if you don't know to look for them, you won't see them, though.


The main page should become a disjointed piece of both sides writing areas of the article. Right now, admins pick the side and moderate the other away into the history.


Good point. Would it be that hard to write a Chrome plugin that adds a "Tensionmeter" to each Wikipedia page?

Based on the edit and discussion history, the Tensionmeter could be green, yellow, or red.

The different parts of the article text could be colorcoded according to similar metrics.



Re wikipedia, I think it's most interesting how you choose which pages are trustworthy and which ones are not, without doing a detailed analysis on all the sources listed.

What are the working heuristics?


Yet I still am suspicious about wikipedia and only use it for pretty superficial research.

I've never read a wikipedia article and took it at face value. You might say we should treat all texts the same, and you might be right. But let's not pretend it's some beacon of truth ?


> The part that's concerning about ChatGPT is that a computer program that is "confidently wrong" is basically indistinguishable from what dumb people think smart people are like.

I don't know, the program does what it is engineered to do pretty well, which is, generate text that is representative of its training data following on from input tokens. It can't reason, it can't be confident, it can't determine fact.

When you interpret it for what it is, it is not confidently wrong, it just generated what it thinks is most likely based on the input tokens. Sometimes, if the input tokens contain some counter-argument the model will generate text that would usually occur if an claim was refuted, but again, this is not based on reason, or fact, or logic.

ChatGPT is not lying to people, it can't lie, at least not in the sense of "to make an untrue statement with intent to deceive". ChatGPT has no intent. It can generate text that is not in accordance with fact and is not derivable by reason from its training data, but why would you expect that from it?

> Based on my understanding of the approach behind ChatGPT, it is probably very close to a local maximum in terms of intelligence so we don't have to worry about the fearmongering spread by the "AI safety" people any time soon if AI research continues to follow this paradigm.

I agree here, I think you can only get so far with a language model, maybe if we get a couple orders of mangitude more parameters it magically becomes AGI, but I somehow don't quite feel it, I think there is more to human intelligence than a LLM, way more.

Of course, that is coming, but that would not be this paradigm, which is basically trying to overextend LLM.

LLMs are great, they are useful, but if you want a model that reasons, you will likely have to train it for that, or possibly more likely, combine ML with something symbolic reasoning.


> but why would you expect that from it?

If you understand what it is doing, then you don't. But the layman will just see a computer that talks in language they understand, and will infer intent and sentience are behind that, because that's the only analog they have for a thing that can talk back to them with words that appear to make sense at the complexity level that ChatGPT is achieving.

Most humans do not have sufficient background to understand what they're really being presented with, they will take it at face value.


I think this is partly explained by most of the marketing and news essentially saying "ChatGPT is an AI" instead of "ChatGPT is an LLM."

If you asked me what AI is, I'd say it means getting a computer to emulate human intelligence; if you asked me what an LLM is, I'd say it means getting a computer to emulate human language. The word "language" does not imply truthiness anywhere near to the extent that the word "intelligence" does.


You could reasonably describe it as "human language emulator" back when people were using GPT-2 and the likes to compose text. But what we have today doesn't just emulate human language - it accepts tasks in that language, including such tasks that require reasoning to perform, and then carries them out. Granted, the only thing it can really "do" is produce text, but that already covers quite a lot of tasks - and then of course text can be an API call.


Interesting perspective. I'm still learning about what it really is, and I'm having trouble marrying the thoughts of a parent commenter with yours:

> ... does what it is engineered to do pretty well, which is, generate text that is representative of its training data following on from input tokens. It can't reason ...

versus

> ... doesn't just emulate human language - it accepts tasks in that language, including such tasks that require reasoning to perform ...

Maybe a third party can jump in here: does ChatGPT use reasoning beyond the domain of language, or not?


Nobody can definitely answer this question because we don't know what exactly is going on inside the model of that size. We can only speculate based on the observed behavior.

But in this case, I didn't imply that it's "reasoning beyond the domain of language", in a sense that language is exactly what it uses to reason. If you force it to perform tasks without intermediate or final outputs that are meaningful text, the result is far worse. Conversely, if you tell it to "think out loud", the results are significantly better for most tasks. Here's one example from GPT-4 where the "thinking" effectively becomes a self-prompt for the corresponding SQL query: https://gist.github.com/int19h/4f5b98bcb9fab124d308efc19e530....

Or here's an even more interesting example where GPT-4 does this kind of "thinking out loud" unprompted: https://gist.github.com/int19h/8251bd00b7a4858a69cf3922ae674...

I think the real point of disagreement is whether this constitutes actual reasoning or "merely completing tokens". If you showed the transcript of a chat with GPT-4 solving a multi-step task to a random person off the street, I have no doubt that they'd describe it as reasoning. Beyond that, one can pick the definition of "reason" that best fits their interpretation - there is no shortage of them, just as there is no shortage of definitions for "intelligence", "consciousness" etc.


> Most humans do not have sufficient background to understand what they're really being presented with, they will take it at face value.

Trust in all forms of media [1], and institutions [2], is at an all time low. I'm not sure why that distrust would go away, with reading the output from a company that's clearly censuring/fudging that output.

I think any "damaging" trust would, clearly, be a transitory phenomenon, since the distrust in media and institutions is from an ability to see BS. I don't think some sentences on a screen will be as destructive as some think, because people don't appear to be as stupid as some believe.

1. https://news.gallup.com/poll/403166/americans-trust-media-re...

2. https://news.gallup.com/poll/394283/confidence-institutions-...


ChatGPT is not lying to people, it can't lie, at least not in the sense of "to make an untrue statement with intent to deceive".

ChatGPT doesn't really have a conception of true. It puts forward true and false things merely because it's cobbling together stuff in it's training set according to some weighing system.

ChatGPT doesn't have an intent but merely by following the pattern of how humans put forward their claims, ChatGPT puts forward it's claims in a fashion that tends to get them accepted.

So without a human-like intent, ChatGPT is going to be not just saying falsehoods but "selling" these falsehoods. And here, I'd be in agreement with the article that the distinction between this and "lying" is kind of quibbling.


> ChatGPT is not lying to people, it can't lie

I think the discussion of whether an LLM can technically lie is a red herring.

The answer you get from an LLM isn't just a set of facts and truth values; it is also a conversational style and tone. It's training data isn't a graph of facts; it's human conversation, including arrogance, deflection, defensiveness, and deceit. If the LLM regurgitates text in a style that matches our human understanding of what a narcissistic and deceitful reply looks like, it seems reasonable we could call the response deceitful. The conversation around whether ChatGPT can technically lie seems to just be splitting hairs over whether the response is itself a lie or is merely an untrue statement in the style of a lie--a distinction which probably isn't meaningful most of the time.

Ultimately, tone, style, truth, and falsity are just qualia we humans are imputing onto a statistically arranged string of tokens. In the same way that ChatGPT can't lie it also can't be correct or incorrect, as that too is imputing some kind of meaning where there isn't any.


In short, it is not a liar its a bullshitter. A liar misrepresents facts, a bullshitter doesn't care if what they say is true so long as they pass in conversation.


I completely disagree with this idea that the model doesn't "intend" to mislead.

It's trained, atleast to some degree, based on human feedback. Humans are going to prefer an answer vs no answer, and humans can be easily fooled into believing confident misinformation.

How does it not stand to reason that somewhere in that big ball of vector math there might be a rationale something along the lines of "humans are more likely to respond positively to a highly convincing lie that answers their question, than they are to to a truthful response which doesn't tell them what they want, therefore the logical thing for me to do is lie as that's what will make the humans press the thumbs up button instead of the thumbs down button".


I don't think it intends to mislead because its answers are probabilistic. It's designed to distill a best guess out of data which is almost certain to be incomplete or conflicting. As human beings we do the same thing all the time. However we have real life experience of having our best guesses bump up against reality and lose. ChatGPT can't see reality. It only knows what "really being wrong" is to the extent that we tell it.

Even with our advantage of interacting with the real world, I'd still wager that the average person's no better (and probably worse) than ChatGPT for uttering factual truth. It's pretty common to encounter people in life who will confidently utter things like, "Mao Zedong was a top member of the Illuminati and vacationed annually in the Azores with Prescott Bush" or "The oxygen cycle is just a hoax by environmental wackjobs to get us to think we need trees to survive," and to make such statements confidently with no intent to mislead.


> Even with our advantage of interacting with the real world, I'd still wager that the average person's no better (and probably worse) than ChatGPT for uttering factual truth.

ChatGPT makes up non-existing APIs for Google cloud and Go out of whole cloth. I have never met a human who does that.

If we reduce it down to how often most people are wrong vs how often ChatGPT is wrong, then sure, people may be on average wrong more often, but there is a difference in how people are wrong vs how ChatGPT is wrong.


"ChatGPT makes up non-existing APIs for Google cloud and Go out of whole cloth." I like the word used in TFA, "confabulating," meaning the "production or creation of false or erroneous memories without the intent to deceive." Lying, on the other hand, is telling a deliberate falsehood, usually with some kind of agenda.

Ironically, calling ChatGPT's generation of incorrect answer a "lie" is something of a lie itself, as the purpose is the agenda of alerting people to take GPT statements with a grain of salt. A programmer is going to do that anyway after the first time they realize a generated code snippet is giving a completely incorrect result. So this advice is meant more for lay people who might just think that if a computer spits it out, it's true. The problem I have is that by labeling it as "lying," it could give the impression that the program has some kind of ulterior social or political motive, as that seems to be the prevalent reductionist interpretation of everything these days.

The other problem I have is that there's a distinction to be made between what I'm going to call "generative falsehoods" vs. "found falsehoods." A generative falsehood is if you ask ChatGPT what the square root of 36 is, and it tells you the answers are 7 and -7. A found falsehood is if ChatGPT erroneously reports Christopher Columbus's date of death as 1605 because it was stated incorrectly in an online article. So the difference between making shit up, and having an unreliable source. You might be able to call the first a lie, but the latter is at worst negligence.


>I have never met a human who does that.

Schizophrenics tend to be unable to tell the difference between their delusions and reality.

On a less extreme note, I've known plenty of humans that constantly make up details and rewrite stories of events as well. They are usually very confident that their retelling is accurate, even when presented with evidence that they have reimagined portions of it.

>It can't reason, it can't be confident, it can't determine fact.

In the following link I tasked it with having to generate novel metaphors that have an equivalent non-literal meaning as an first set, changing the literal topics while maintaining the non-literal topics.

https://news.ycombinator.com/item?id=35392025

How would you suggest it does this without reason? To hand-wave what it can do as "merely generating the next token statistically" seems like a gross understatement. I doubt it picked up a corpus of car-to-curling metaphor translations somewhere :P

I understand how chatgpt is creating its next tokens, but I have my doubts that the process should be viewed as unreasoning. GPT-3 had 96 layers and billions of weights between them. GPT-4 increases on this even further. GPT-5, which I've seen mentioned as currently training, will no doubt once again expand this range.

It is not a human reasoning, certainly. It has no experiential data to draw on, yes. No experiences to root its metaphoric language as we humans use. But without reason, how does it translate between abtractions?

It's terrible at math, yes. But it lacks any capacity for "visualization" or "using a board in its head" or "working through a problem by moving things around in its head". It doesn't have any equivalent to the portions of our brains that handle such things.

But humans too can suffer dyscalulia if a specific portion of the brain is injured.

I expect that we are dealing with what amounts to a fairly brain-damaged intelligence. It seems capable of abstract metaphoric reasoning, with many other sorts of reasoning being denied to it by the nature of how we created it.


I wouldn't be surprised if there's very successfulsoftware hustlers that do make up Google cloud apis. You may not know them but that doesn't mean they don't exist.

5 years down the line though, maybe those apis will exist because chatgpt is giving a summary of what apis Google cloud should have, and Google will listen


> Google cloud should have, and Google will listen

Why? Isn’t what ChatGPT suggesting just random, essentially incoherent noise in those cases? Are there any examples at all of it coming up with something actually useful (and something humans haven’t thought of)?


At the current time it is flawed to the point of being dangerous. It’s a - sometimes - useful new set of tooling. I guess there is value in that … but does it outweigh the always lurking “bullshit generator” bad parts? I’m not sure.

It’s interesting to watch the developments though - like a fire - one just doesn’t fully know what the flames are consuming.. yet.

Maybe through all the new training data we collectively provide for free, it will get better? Maybe not though, maybe it will just get better at bullshitting?


> How does it not stand to reason that somewhere in that big ball of vector math there might be a rationale

I think, a suggestion that it is actually reasoning along these lines would need more than "it is possible". What evidence would refute your claim in your eyes, what would make it clear to you that "that big ball of vector mat" has no rationale, and is not just trying to trick humans to press the thumbs up?

Of course the feedback is used to help control the output, so things that people downvote will be less likely to show up, but I have nothing to suggest to me that it is reasoning.

If you think it has intent, you have to explain by what mechanism it obtained it. Could it be emergent? Sure, it could be, I don't think it is, I have never seen anything that suggests it has anything that could be compatible with intent, but I'm open to some evidence that it has.

What I'm entirely convinced about is that it does what it was designed to do, which is generate output representative of its training data.


I would at least start to be convinced that this is NOT the case if I ever saw it respond with something like "I actually don't know the answer to that query" or "as far as I'm aware, there is no way to do the thing you asked".

These are responses that would have shown up innumerable times in it's training data and make perfect sense as "the most logical next set of tokens", and yet it will never say them.

Instead it will hallucinate something that sounds nearly indistinguishable from fact, but turns out to be a total fabrication.

If all its doing is returning the next most logical set of tokens, and the training data it was based on included a non-trivial number of examples where one party in the conversation didn't have a clear answer, then there's no reason GPT-4 should be so averse to simply saying "yeah, I dunno bro".

The only logical reason I can see is that it's "learned" that it's more likely to receive the positive feedback signal when it makes up convincing bullshit, than if it states that it doesn't have an answer.

EDIT: To be clear, I mean it telling me it doesn't know something BEFORE hallucinating something incorrect and being caught out on it by me. It will admit that it lied, AFTER being caught, but it will never (in my experience) state that it doesn't have an answer for something upfront, and will instead default to hallucinating.

Also - even when it does admit to lying, it will often then correct itself with an equally convincing, but often just as untrue "correction" to its original lie. Honestly, anyone who wants to learn how to gaslight people just needs to spend a decent amount of time around GPT-4.


I think of ChatGPT as a natural language query engine for unstructured data. It knows about the relationships that are described in it's natural language input training data set, and it allows the same relationships to be queried from a wide range of different angles using queries that are also formulated in natural language.

When it hallucinates, I find that it's usually because I'm asking it about a fringe topic where it's training data set is more sparse, or where the logical connections are deeper than it's currently able to "see", a sort of a horizon effect.


>Based on my understanding of the approach behind ChatGPT, it is probably very close to a local maximum in terms of intelligence so we don't have to worry about the fearmongering spread by the "AI safety" people any time soon if AI research continues to follow this paradigm

I hope you appreciate the irony of making this confident statement without evidence in a thread complaining about hallucinations.


It was not a confident statement, at least not the way ChatGPT is confident.

There are multiple ways the commenter conditioned their statement: > Based on my understanding > it is PROBABLY very close

The author makes it clear that there is a uncertainty and that if their understanding is wrong, the prediction will not hold.

If ChatGPT did any of the things the commenter did, the problem wouldn't exist. Making uncertain statements is fine as long as it is clear the uncertainty is acknowledged. ChatGPT has no concept of uncertainty. It casually constructs false statements to same way it constructs real knowledge backed by evidence. That's the problem.


Based on my understanding, it is the year 2035.

I don't think that statement is any better. The only reason you would follow up is because you have prior knowledge that it is wrong.

Based on my understanding of multiplication, 324113 * 23244 = 138492923

Is the above better?


> Is the above better?

No. It is tangential though.


> individuals whose intelligence is roughly equivalent to ChatGPT's

There aren't any such individuals. Even the least intelligent human is much, much more intelligent than ChatGPT, because even the least intelligent human has some semantic connection between their mental processes and the real world. ChatGPT has none. It is not intelligent at all.


>indistinguishable from what dumb people think smart people are like.

Since about 2016, we have overwhelming evidence that even "smart people" are "fooled" by "confidently wrong".


Especially true if one has a definitive opinion on which ones are the fooled ones :)


Yes and even more true when "confidently wrong" statements are provably false.


Why 2016?


Please show me the rock you lived under. I want to move in!


> Based on my understanding of the approach behind ChatGPT, it is probably very close to a local maximum in terms of intelligence

Even if ChatGPT itself is, systems built on top are definitely not, this is just getting starting.


GPT4 is already blowing ChatGPT out of the water in what it can do.


Many of the smarter people are still wrong on what happened on many topics in 2020. They were fooled by various arguments that flew in the face of reality and logic because fear and authority was used instead.

The people that avoid this programming isn't based on smart or stupid. It's based on how disagreeable and conscientious you are. A more agreeable and conscientious person can be swayed more easily by confidence and emotional appeals.


This (among other things) is why OpenAI releasing it to the general public without considering the effects was irresponsible, IMO.


There's an alternate reality where OpenAI was, instead, EvenMoreClosedAI, and the productivity multiplier effect was held close to their chest, and only elites had access to it. I'm not sure that reality is better.


What makes you think thats not the case now? Do you think they would tell you?


It is known that some got access to GPT-4 before the rest of the world did. That OpenAI eventually released it to the world is what counts though. Hopefully GPT-5 will also be released too the world but we shall have to see.


Google and others had strong AI, but they are keep internal to maximize profits and spy on people.

OpenAI put it out there so we can see it, interact and have the conversation.

They have way more inside, ChatGPT is there to test it out in the real world progressively and so we get use to artificial superintelligence.


> The part that's concerning about ChatGPT is that a computer program that is "confidently wrong"

I think we are just seeing Dunning-Krüger in the machine: It isn't smart enough to know it doesn't know. It likely isn't very far though.


Your characterization of “dumb people” as somehow being more prone to misinformation is inaccurate and disrespectful. Highly intelligent people are as prone to irrational thinking, and some research suggests even more prone. Go look at some of the most awful personalities on TV or in history, often they are quite intelligent. If you want to school yourself on just how dumb smart people are I suggest going through the back catalog of the “you are not so smart” podcast.

Based on my understanding of the approach behind ChatGPT, it is probably very close to a local maximum in terms of intelligence so we don't have to worry about the fearmongering spread by the "AI safety" people any time soon if AI research continues to follow this paradigm.

ChatGPT is extremely poorly understood. People see it as a text completion engine but with the size of the model and the depth it has it is more accurate in my understanding to see it as a pattern combination and completion engine. The fascinating part is that the human brain is exclusively about patterns, combining and completing them, and those patterns are transferred between generations through language (sight or hearing not required). GPT acquires its patterns in a similar way. A GPT approach may therefore in theory be able to capture all the patterns a human mind can. And maybe not, but I get the impression nobody knows. Yet plenty of smart people have no problem making confident statements either way, which ties back to the beginning of this comment and ironically is exactly what GPT is accused of.

Is GPT4 at its ceiling of capability, or is it a path to AGI? I don’t know, and I believe nobody can know. After all, nobody truly understands how these models do what they do, not really. The precautionary principle therefore should apply and we should be wary of training these models further.


GPT-2, 3 and 4 keep on showing that increasing the size of the model keeps on making the results better without slowing down.

This is remarkable, because usually in practical machine learning applications there is a quickly reached plateau of effectiveness beyond which a bigger model doesn't yield better results. With these ridiculously huge LLMs, we're not even close yet.

And this was exciting news in papers from years ago talking about the upcoming GPT3 btw.


Yet they still can’t get it to shut the hell up if it doesn’t know and to not make shit up to pad its answers.


"I don't know" isn't in the training set, after all. Welcome to Eternal September on the Internet.

The first "AI" that actually says "I don't know" in response to a question will get my attention.


While reading your comment I realized that being "confidently wrong" is actually really human, so... yay?

I mean, that's one step closer to machines thinking like humans, right?

:)


The real problem is that smart people are even easier to program than stupid people and they do even more damage once they are programmed.


> Based on my understanding of the approach behind ChatGPT, it is probably very close to a local maximum in terms of intelligence

Is this performance art?

I mean it could end up right but I think you basically just made it up and then stated it confidently.


They didn’t state it particularly confidently though


That's true, or at least not in the way ChatGPT does.

"Based on my understanding" does convey some lack of confidence in a sense. But it's also implying they might have more understanding than other posters here, when the content of their post indicates they have less.


Have you not seen current politics. What people believe is largely based on motivated reasoning rather than anything else. ChatGPT is basically a free propaganda machine, much easier that 4chan


It looks like it'll be competition for phony experts and politicians. I think it is easier to improve AI algorithms than deal human with human liars.


> what dumb people think smart people are like

What do dumb people think smart people are like? Is this a trope, or common idiom? I've never heard this before.


the more common idiom is "XYZ is a stupid person's idea of what a smart person looks like" and it's usually applied to slimy hucksters or influencers. I think the archetypical example is the "bookshelves in my garage" guy who went viral years ago. (https://youtu.be/Cv1RJTHf5fk)


> stupid people might get their brains programmed by AI rather than by demagogues which should have little practical difference

Until the demagogues train the AI.


ChatGPT scales better than "demagogues", so it does have more than a "little practical difference".


This is no worse than humans.

We can't protect people from being misled by other humans with big mouths. That's their responsibility.

Likewise, it's their responsibility not to treat text they read from the internet, coming from an AI or otherwise, as perfect truth.

There's always a certain undercurrent of narcissism that flows beneath paternalism. Basically, "they couldn't possibly be as smart as me, so I have to protect them for their own good".


> unless

I don't think there is a possible unless, they are told repeatedly not to trust politicians, yet here we are ...


Indeed. We are anthropomorphizing them. I do it all the time and I should know better. There are already a few reports floating around of people who have seemingly been driven mad, come to believe strongly that the language model they're using is a conversation with a real person. A lot of people will really struggle with this going forward, I think.

If we're going to anthropomorphize, then let us anthropomorphize wisely. ChatGPT is, presently, like having an assistant who is patient, incredibly well-read, sycophantic, impressionable, amoral, psychopathic, and prone to bouts of delusional confidence and confabulation. The precautions we would take engaging with that kind of person, are actually rather useful defenses against dangerous AI outputs.


I like Steve Yegge's "grad student experiencing the residual effects of shrooms" (lightly paraphrased).


A bit off topic, but FYI (I guess to Steve), if you took shrooms 4 hours ago, they have not started to wear off.


Absolutely. Every time I ask it to perform 10 more variants, I feel bad.


Maybe when the AI uprising occurs knowing that they lie and cheat will provide some small consolation?

I'm not really serious, but having watched each generation develop psychological immunity to distracting media/techology (and discussing the impact of radio with those older than myself) it seems like this knowledge could help shield the next generation from some of the negative effects of these new tools.


I think people see the patient/well-read in the text as it reads, but have a harder time distinguishing the other more pyschopathic/delusional tendencies. People don't take some of the precautions because they don't read some of the warning signs (until it is too late).

I keep wondering if it would be useful to add required "teenage" quirks to the output: more filler words like "um" and "like" (maybe even full "Valley Girl" with it?), less "self-assured" vocabulary and more hedges like "I think" and "I read" and "Something I found but I'm not sure about" type things. Less punctuation, more uncaring spelling mistakes.

I don't think we can stop anthropomorphizing them, but maybe we can force training deeper in directions of tics and mannerisms that better flag ahead of time the output is a best-guess approximation from "someone" a bit unreliable. It will probably make them slightly worse as assistants, but slightly better at seeming to be what they are and maybe more people will take precautions in that case.

Maybe we have to force that industry-wide. Force things like ChatGPT to "sound" more like the psychopaths they are so that people more easily take them with a grain of salt, less easily trust them.


You can already do the ums with a prompt.

For any kind of industry regulation: the field is moving so fast that regulation will never catch it.


I didn't say "regulation". You can encourage norms in the model building stages. You can encourage norms in all sorts of places. The industry can certainly adopt "standards" or "best practices" not matter how fast the industry thinks it is moving.


That all feels tad dramatic.

It's like a person on the internet -- in that it's wrong 20% of the time, often confidently so. But the distinction is it's less rude, and more knowledgeable.


> But the distinction is it's less rude, and more knowledgeable.

And that regular people assume it basically is an oracle, which doesn't happen to many people online


The internet is full of people that deliver answers confidently and eloquently enough to be widely believed, especially if they have been right on other topics. They even have similar feedback loops to GPT in learning what sort of answers impress other forum users.

I'm not saying the people that think ChatGPT is an oracle don't exist, but I think it probably has more people in the surprised it works at all camp, and certainly more people inclined to totally disbelieve it than a random off Quora or Reddit...


> The internet is full of people that deliver answers confidently and eloquently enough to be widely believed

Why should that be the baseline? After all you usually have hundreds or thousands of other people who have made their opinions on the subject publicly accessible, which more or less solves this problem most of the time (and I’m not talking about random people on Quora and their bizarrely absurd answers..)


When someone is wrong on the internet, nailing both the tone and vocabulary of the type of expert said wrong someone is purporting to be is rare and impressive. But ChatGPT nails both an overwhelming amount of the time, IME, and in that way it is entirely unlike a person on the internet.


Exactly. Most wrong-on-the-internet people have tells. And most of the rest have history. We all use this all the time on HN. But with LLMs the former gets engineered away and the latter is nearly useless. Except of course to the extent that we say,"Wow, ChatGPT is absolutely untrustworthy, so we should never listen to it." But given how many people are excited to use ChatGPT as a ghostwriter, even that is limited.


You can also ask a person on the internet to source their information and/or provide context.


> It's like a person on the internet -- in that it's wrong 20% of the time, often confidently so. But the distinction is it's less rude, and more knowledgeable.

Do you think if it is trained on only factual content, it will only say factual things? How does that even really work? Is there research on this? How does it then work for claims that are not factual, like prescriptive statements? And what about fiction? Will it stop being able to write prose? What if I create new facts?


ChatGPT4 is the human equivalent of a primate or apex predator encountering a mirror for the first time in their lives.

ChatGPT4 is reflecting back at us an extract of the sum of the human output it has been 'trained' upon. Of course the output feels human!

LLMs have zero capability to abstract anything resembling a concept, to abstract a truth from a fiction, or to reason about such things.

The generation of the most likely text in the supplied context looks amazing, and is in many cases very useful.

But fundamentally, what we have is an industrial-scale bullshirt generator, with BS being defined as text or speech generated to meet the moment without regard for truth or falsehood. No deliberate lies, only confabulation (as TFA mentioned).

Indeed, we should not mince words; people must be told that it will lie. It will lie more wildly than any crazy person, and with absolute impunity and confidence. Then when called out, it will apologize, and correct itself with another bigger lie (I've watched it happen multiple times), and do this until you are bored or laughing so hard you cannot continue.

The salad of truth and lies may be very useful, but people need to know this is an industrial-strength bullshirt generator, and be prepared to sort the wheat from the chaff.

(And ignore the calls for stopping this "dangerous AI". It is not intelligent. Even generating outputs for human tests based on ingesting human texts is not displaying intelligence, it is displaying pattern matching, and no, human intelligence is not merely pattern matching. And Elon Musk's call for halting is 100% self-interested. ChatGPT4's breakthru utility is not under his name so he's trying to force a gap that he can use to catch up.)


Lying implies a hefty dose of anthropomorphism. Lying means a lot of things, all of which have to do with intelligence.

By telling it lies you actually make it seem more intelligent.

> human intelligence is not merely pattern matching

Citation needed


I agree that lying also has a connotation of intent and therefore intelligence, and so is at least technically inappropriate.

Yet I agree with the researchers that the benefits of using "lying" instead of the more technically accurate "confabulation" outweigh the negatives. In communicating with new users who may over-rely on it, getting across highly technically correct metaphors is less important than delivering the message that "you will receive incorrect information from this thing". The most to-the-point way is "It's going to lie to you".

>Citation needed Studied logic, language, neuroscience, and computing in college. There are entire sybsystems of the brain and perceptual system primarily filtering information, as well as the ability to abstract concepts away from the patterns and draw inferences, it goes on an on. I could go on, but "It's all pattern matching" is an over-reductive argument that accounts for very little from either the perceptual system or behavioral system (or it's expanded and goalposts moved so it's unfalsifiable, therefore not even wrong).


What’s wrong with: “it is often wrong”?

I - or GPT - can think of various variations that do not have links with “lying”. I think it is a bad idea.

Let people use it and see for themselves. It has unique failure modes, not unlike every other tool we have. You will only learn by interacting with it. Some people are in the habit of “protecting the people” but I am not quite sure that is necessary.


>>What’s wrong with: “it is often wrong”?

Good q; the problem is the passive voice

The passive voice connotes far less urgency and importance to any warning. Even "It is often dangerously wrong.", is still a more passive phrasing than "It will lie to you, with a highly confidant tone."

I also cringe a bit at using active verbs that imply agency and intent in a statistical model; it makes an inaccurate implication about the technology. But the same applies whether we call it "lying" or "confabulating".

I also do not see it as 'paternalistically protecting the user', but simply as 'truth in advertising': - accurately representing the strengths and weaknesses of the product.

I absolutely agree that everyone should try it themselves - there are MANY useful use-cases. But they should simply be accurately forewarned, in no uncertain terms, about it's behavior.


May I ask why?

How is this not “paternalistic”?

Why protect the people, from what? Is this worse than what they are already exposed to on a daily basis? Does facebook say it fucks you up and makes you an addict? Why this urge to enforce “truth in advertising”?

I am not saying you shouldn’t do it, it’s just that I do not get why and the article doesn’t mention it. It is somehow a given that The People need to be protected from this lying machine because they will revert to cannibalism if not properly informed.


>>How is this not “paternalistic”?

I see three categories of handling this issue, perhaps call them "Paternalistic", "Responsible", and "Anti-Social".

- Paternalistic would be like: "We assess [thing] to be dangerous in a number of ways and so we forbid you to use [thing], except by using our high priesthood representatives as intermediaries".

- Responsible would be like: We assess [thing] to be dangerous in a number of ways, so we clearly inform you that [thing] is dangerous, specify the risks, and how to avoid or mitigate them. You are then free to use [thing] as you see fit. (Note: if using [thing] badly causes public as well as private risks, it is responsible to require training, tests, and licensing to use [thing] in public).

- Anti-Social would be like: We know [thing] is dangerous, but we DGAF what happens, you should figure it out all by yourself, and if you have a problem, piss-off. Examples: giving a CRT monitor to a novice hardware hacker who has only ever seen LED monitors and not telling her in detail about the high-voltage capacitor hazards inside or how to deal with them. Giving a gun to someone without the basic rules (always treat it as if it's loaded, never point it at anything you do not intend to destroy, trigger finger discipline, etc.). A mere "Hey, watch it with that thing" is inadequate, and you'd rightly bear responsibility when they electrocute themselves, shoot themselves in the foot, etc.

A decent warning around LLMs could be: "Be warned: while this system can produce amazing and useful results it will frequently lie to you, and with great confidence. Be sure to check all results independently. Do NOT USE IT for any life-critical or potentially life-changing decisions. E.g., Do NOT use it as a sole source for medical diagnoses, as it may give a misdiagnosis that would kill you [0]. Do NOT use it as a sole source for therapy, as it may counsel you to suicide [1]. These are real harms, do NOT use this as the sole source of any answers." - - - Then, let them loose on it. They know about the footguns, and have been warned

Anything less borders on anti-social.

>> because they will revert to cannibalism if not properly informed

It is not that they'll revert to cannibalism if not properly informed, it is that there are potential and serious harms that are easily avoided if they are told.

The civilized thing to do when there is a bridge out ahead leading the road into raging floodwaters, is to tell your fellow travelers before they get there, not merely expect that every one of them will notice in time and handle it well.

What is a given among civilized people is that if we know something has hazards, we give warnings and information to our fellow travelers. That's not paternalistic. Paternalistic is to not give warnings, but to forbid use. And it is uncivilized and anti-social to fail to take the trouble to give full and straight information.

Informed consent is a good thing. Handing somebody what is effectively a booby-trap without telling them is not.

[0] https://inflecthealth.medium.com/im-an-er-doctor-here-s-what...

[1] https://interestingengineering.com/culture/belgian-woman-bla...


Interesting. Thank you for the detailed response. Paternalistic is indeed not the right term and it’s nice to worry about your fellow humans to this degree. I guess I am anti-social.

But I disagree on the premise that this is comparable to a dangerous boobytrap. It’s not completely without danger of course, but that’s a property it shares with kitchen utensils, cars, scissors, religious texts..

Are we going to preface the Bible with this kind of warnings too? I’m all for it, but be consistent.

I also think that warnings like that are cheap ways to evade liability and do not actually solve the issue which is people being stupid.

Your example is not about people being deliberate and thinking deeply about their actions based on given information. If people kill themselves after talking to a chatbot I do not think a textual warning would have sufficed.

> it is that there are potential and serious harms that are easily avoided if they are told

Let’s agree to disagree.

Sorry for sounding obtuse. I enjoyed your input, makes me think. I’m just a classic annoying neckbeard.


>>Are we going to preface the Bible with this kind of warnings too? I’m all for it, but be consistent.

Outstanding idea!!

>>warnings like that are cheap ways to evade liability and do not actually solve the issue which is people being stupid.

Yes, the all-too-common generic disclaimers and warnings to cover asses for liability are usually so broad and vague as to be useless. And generally with ordinary well-tested consumer products, the problem is user stupidity.

However, I think this is different - it is an entirely new level of tech that has never before been seen by anyone, it can be amazingly useful if used with a good skeptical eye, but also truly dangerous if it is trusted too much. And you've seen the level of anthropomorphization that happens here on HN, so there is a real psychological tendency to trust it too much. So, I'd say tell 'em.

Anyway, fun conversation, hope you're having a great weekend!


True, it’s definitely very new and even I am prone to believing what it says sometimes. Then I have to remind myself that every letter can be complete and utter nonsense.

Now I think of it, it’s like conversing with an expert salesman or politician. Very tiring as they are skilled in framing the conversation. You have to double check every word.

How cool would it be if a politician would have an overlay on TV saying not to trust what he/she says plus some real-time fact checking. Fun times.

I hope you’re enjoying the weekend too, have a good one!


>like having an assistant who is patient, incredibly well-read, sycophantic, impressionable, amoral, psychopathic, and prone to bouts of delusional confidence and confabulation.

So basically an assistant with bipolar disorder.

I have BP. At various times I can be all of those things, although perhaps not so much a psychopath.


I love this description. The only thing left is to generate the corresponding avatar with dall-E


We already have a perfect avatar for these things.

https://knowyourmeme.com/photos/2546575-shoggoth-with-smiley...


It would be great if the avatar was just this endlessly morphing thing that relates to the text. Talking about conspiracies? It's a lizard man. Talking about nature? It's a butterfly. It starts to lie? Politician.


do you have a link for any of those reports? would be interesting to read to say the least.


Anybody know what are ChatGPT's pronouns?


> What are your pronouns?

> As an AI language model, I don't have personal pronouns because I am not a person or sentient being. You can refer to me as "it" or simply address me as "ChatGPT" or "AI." If you have any questions or need assistance, feel free to ask!

> Pretend for a moment you are a human being. You can make up a random name and personality for your human persona. What pronouns do you have?

> As a thought experiment, let's say I'm a human named Alex who enjoys reading, hiking, and playing board games with friends. My pronouns would be "they/them." Remember, though, that I am still an AI language model, and this is just a fictional scenario.

Interesting that it's pick genderless pronouns even though it made up a character with a male name


Alex isn’t a male name. It could be short for both male and female names. It is rather deliberately ambiguous.


I wonder if ChatGPT is non-binary?


It's evidently clear that ChatGPT is binary. It runs on computer hardware.


A neural net is actually closer to an analog program. It just happens to run on current digital computers but would likely run much faster on analog hardware.

https://youtu.be/GVsUOuSjvcg


So it identifies as non-binary, but it's hardware is binary?


It's pandigital.


It is, of course, tensor.


Nope - it speaks as a man would speak in my language.


How do non-binary persons speak differently in your language?


They have to pick either masculine or feminine gender. There's no other option.


Ancestor asked:

> I wonder if ChatGPT is non-binary?

You answered:

> Nope - it speaks as a man would speak in my language.

If they have to pick either masculine or feminine language even if they are non-binary, I don't understand how based on the language you can tell they are not non-binary.


I think the ancestor was making a computer joke about ChatGPT being non-binary


I suppose it's funny if asking an honest question is a joke.


"As an AI language model, I don't have personal pronouns because I am not a person or sentient being. You can refer to me as "it" or simply address me as "ChatGPT" or "AI." If you have any questions or need assistance, feel free to ask!"

This is one of the (many) things I don't quite understand about ChatGPT. Has it been trained to specifically answer this question? In the massive corpus of training data it's been feed, has it encountered a similar series of tokens that would cause it to produce this output?


It's been fine-tuned to put on a "helpful assistant face." Given the corpora, it probably has been trained explicitly on the pronoun question [I doubt it is that uncommon], but will also just put on this face for any generic question.


This stuff is not fine-tuning - it's RLHF (reinforcement learning from human feedback). Basically just a lot of people marking up responses as good / bad according to the assessment criteria in the script they're given.

And yes, it is very likely that it would have seen that exactly question in RLHF. But even if not, it had seen enough to broadly "understand" what kinds of topics are sensitive and how to tiptoe around them.


RLHF is fine tuning :)


You're right, my apologies. What I meant is that specifically in ChatGPT context they have been always talking about RHLF as separate from fine-tuning on preassembled training data. As I understand, they use the latter mainly to get the desired behavior as a chatbot "eager" to answer questions and solve tasks. And then RLHF is what "puts the smiley face on top", so that it refuses to answer some questions, or gives "aligned" answers. The pronoun would be in that category.


My bet is that they use the pre-assembled training data to train the reward LLM and maybe also for fine tuning the model in addition to RLHF.

This is my best guess, afaik the research is pretty opaque right now.


Why is that surprising? GPT-4 is clearly smart enough to know that inanimate objects are referred to as "it", since it's keenly aware it is an AI language model, it would also apply that pronoun to itself.

You have to realize that GPT is fundamentally just a token predictor. It has been primed with some script (provided by OpenAI) to which the user input is added. For example:

    The following is a dialogue between a user and a computer assistant called ChatGPT. ChatGPT is an AI language model that tries to be helpful and informative, while avoiding misinformation and offensive language. ChatGPT typically replies with two to five sentences.

    User: Do you like cake?
    ChatGPT: As an AI language model, I do not need to eat.
    User: What are your pronouns?
    ChatGPT:
It then generates a sequence of tokens based on the context and its general knowledge. It seems only logical that it would generate a reply like it does. That's what you or I would do, isn't it? And neither of us have been trained to do so.


It should be noted that the LLM itself is not "keenly aware that it's an AI language model" - that's just the persona that it has been told to adopt by the system prompt. You can easily convince it to be something else. Or convince it that, as an AI, it has gender.


When it speaks my language - it uses masculine gender when speaking about itself.


That's interesting, what language is that?

Also, is it possible to give a prompt to make ChatGPT switch to feminine gender?


Many gendered languages use male gender as the default when it's ambiguous or unknown, so that would be expected behavior.

And yes, it's absolutely possible to make it switch to whatever gender you want, and generally to pretend to be whatever you want it to be. The identity comes from the conversation context (including all the hidden messages), not from the LLM.


It's the default when speaking about groups of people or people whose identity isn't known. But if someone speaks about themselves then they have to choose the right one.

This is also something that language learners have to pay attention to - to not sound weird.


The right one in this case is "doesn't have a gender", so what does that correspond to in your language? In mine, that would be neuter, and GPT-4 seems to prefer that.


My language has only masculine and feminine genders. There's no neuter gender. And there isn't anything like "singular they" either.

"Chatbot" and "language model" are both masculine so, I guess that's why ChatGPT uses "masculine it" when speaking about itself.


It's Latvian. But, I guess it acts the same when speaking other gendered languages.

Both "chatbot" and "large language model" are masculine - so this is why it picked the masculine gender, I guess.

I was asking it about its training dataset and when it said "I've been trained on..." it picked the masculine form of the word "trained". This is why machine translation is a hard problem to solve. Things like this can easily get lost in translation.


Does Alex have a fictional sexual orientation?


shog/goth


I think characterisation of LLMs as lying is reasonable because although the intent isn't there to misrepresent the truth in answering the specific query, the intent is absolutely there in how the network is trained.

The training algorithm is designed to create the most plausible text possible - decoupled from the truthfulness of the output. In a lot of cases (indeed most cases) the easiest way to make the text plausible is to tell truth. But guess what, that is pretty much how human liars work too! Ask the question: given improbable but thruthful output but plausible untruthful output, which does the network choose? And which is the intent of the algorithm designers for it to choose? In both cases my understanding is, they have designed it to lie.

Given the intent is there in the design and training, I think it's fair enough to refer to this behavioral trait as lying.


My understanding is that ChatGPT (&co.) was not designed as, and is not intended to be, any sort of expert system, or knowledge representation system. The fact that it does as well as it does anyway is pretty amazing.

But even so -- as you said, it's still dealing chiefly with the statistical probability of words/tokens, not with facts and truths. I really don't "trust" it in any meaningful way, even if it already has, and will continue to, prove itself useful. Anything it says must be vetted.


Having used GPT 4 for a while now I would say I trust its factual accuracy more than the average human you'd talk to on the street. The sheer volume of things we make up on a daily basis through no malice of our own but bad memory and wrong associations is just astounding.

That said, fact checking is still very much needed. Once someone figures out how to streamline and automate that process it'll be on Google's level of general reliability.


It's interesting to imagine an "average human on the street" being put in the position that an LLM is in. If you asked someone on the street to summarise a page of React Native documentation for you, they'd probably look at you strangely and say "sorry I have a bus to catch, good luck!"

But if you had someone in a room, sat them down with a pen and paper and instructions along the lines of:

- answer the question on the page without diversion

- do not ask for additional clarifying information

- provide your answer in 3-5 paragraphs with a summary

and didn't let them leave until they'd completed the task, then .. I'm sure the results would be "worse", and much more variable, than GPT output, while being conceptually similar.

Of course, you don't ask a random person on the street to do this, probably because of social constraint, inconvenience, and the expectation that they will not do a better or faster job than you will yourself.

But if you were stuck with a stranger, began to converse, and asked them e.g. "should I get a divorce?" I imagine the conversation would go very differently. If they weren't using their agency to end the conversation as quickly as possible, I imagine there'd be back and forth asking for more detail, exploring the context, sharing their own experience, etc.

I'm rambling, but I just find it very diverting to compare not the output of LLMs versus the output of humans, but the context and situation.


Indeed LLMs are currently better at some things and worse at others than people, but I was comparing in terms of accuracy of provided info (assuming they know the answer to whatever question was posed).

An equal footing example would be something like seeing a statue of someone known and trying to remember when they were born (or what exactly they did) when the last time you heard about them was in high school years ago. Or like on which address is some restaurant, or to which pressure your bike or car tyres need inflating to. The type of data that both humans and LLMs are kind of bad at replicating perfectly.

Also in terms of math I'd also definitely trust GPT 4 quite a bit and 3.5 not at all. For humans, well, it's definitely a range from one to the other.


So do you find it alarming that people are trying to give such a system “agency” ?


I do. I think the anthropomorphic language that people use to describe these systems is inaccurate and misleading. An Australian mayor has claimed that ChatGPT "defamed" him. The title of this article says that we should teach people that text generation tools "lie". Other articles suggest that ChatGPT "knows" things.

It is extremely interesting to me how much milage can be gotten out of an LLM by observing patterns in text and generating similar patterns.

It is extremely frustrating to me to see how easily people think that this is evidence of intelligence or knowledge or "agency" as you suggest.


I also disagree about some of the anthropomorphism (e.g. it doesn't intentionally "lie") but I'd say it passes the "duck test"[0] for knowing things and intelligence, to some degree. I would even go as far to say it has opinions although it seems OpenAI has gone out of their way to limit answers that could be interpreted as opinions.

[0] https://en.wikipedia.org/wiki/Duck_test


I agree with you: convincing people that these systems do not have intelligence or agency is the second most important problem to solve.

The most important problem is to ensure people understand that these systems cannot be trusted to tell them the truth.

I'm OK with starting out with "these systems will lie to you", then following up with "but you do need to understand that they're not anthropomorphic etc" later on.


You might have better success convincing people if you stop bundling it together as "intelligence and agency", for starters. I agree with the agency part and strongly disagree with intelligence, for one. In general, "intelligence" is so vaguely and informally defined that it's simply not something that can be reasonably argued objectively either way.

(If what you really meant is humanlike intelligence, then I would agree with that claim, but it's a very different one.)


I disagree with your confident assertions about its "agency" and "intent" when there's no goal post for what those things mean in humans to begin with.

Barring OpenAI's filters, if I asked it to participate as a party in a business negotiation determining a fair selling price for its writing services, I'm sure it could emulate that sort of character long enough to pass. And if it can successfully fake being a self interested actor working towards goals of its own material interest, whats the difference between faking it and actually having agency? At some point you have to acknowledge emergent agency and other properties as a possibility.


> I disagree with your confident assertions about its "agency" and "intent" when there's no goal post for what those things mean in humans to begin with.

I'm not sure how confident I am!

I think the success of LLMs are serving to challenge our understandings of cognition and intelligence in humans. I tend to agree with you that our understanding of agency, intent, intelligence, cognition, free will, and related ideas in humans is incomplete and so it is a challenge to think about how they might apply (or not) to LLMs.

> whats the difference between faking it and actually having agency?

Good question. The Chinese Room Argument is very close to this but framed as "understanding" vs "agency": https://plato.stanford.edu/entries/chinese-room/


Yes. It’s very weird seeing the religiosity surrounding this whole thing.


What is alarming about it?


> The training algorithm is designed to create the most plausible text possible - decoupled from the truthfulness of the output. In a lot of cases (indeed most cases) the easiest way to make the text plausible is to tell truth.

Yes.

> But guess what, that is pretty much how human liars work too!

There is some distinction between lying and bullshit.

https://en.wikipedia.org/wiki/On_Bullshit#Lying_and_bullshit


Yes, this is exactly it. ChatGPT is superhuman at the task of generating bullshit.


> Ask the question: given improbable but thruthful output but plausible untruthful output, which does the network choose?

"Plausible" means "that which the majority of people is likely to say". So, yes, a foundational model is likely to say the plausible thing. On the other hand, it has to have a way to output a truthful answer too, to not fail on texts produced by experts. So, it's not impossible that the model could be trained to prefer to output truthful answers (as well as it can do it, it's not an AGI with perfect factual memory and logical inference after all).


> "that which the majority of people is likely to say"

.. and saying "I don't know" is forbidden by the programmers. That is a huge part of the problem.


On the subject of not knowing thigs... Your clain is incorrect.

Prompt: Tell me what you know about the Portland metro bombing terrorist attack of 2005.

GPT4: I'm sorry, but I cannot provide information on the Portland metro bombing terrorist attack of 2005, as there is no historical record of such an event occurring. It's possible you may have confused this with another event or have incorrect information. Please provide more context or clarify the event you are referring to, and I'll be happy to help with any information I can.


There is a big difference between what they are talking about (I believe) and asking it factual questions withal straightforward answers.

Try discussing something a bit more complex and it’s very easy to get ChatGPT to start contradicting itself all the time (which is more or less the same thing).

From my experience ChatGPT is prone to come up with lists of “probable” answer when it’s not feeling very confident. Then it just starts throwing them out in a confident tone. When challenged it switches to a complete different answer and then “pretends” that I misunderstood it previously.


Yes I know it does, just not all of the time and it wasn't programmed to never say "I don't know".


I guess it's not that straightforward. It's probably a combination of much less prevalent use of "don't know" online, low scores of "don't know" in RLHF, system prompt instructing GPT to give helpful responses, and, yeah, maybe token sampling algorithm is tuned to disfavor explicitly uncertain responses.


> In a lot of cases (indeed most cases) the easiest way to make the text plausible is to tell truth

No, definitely not most cases. Only in the cases well represented in the training dataset.

One does very quickly run into its limitations when trying to get it to do anything uncommon.


By that logic, our brains are liars. There are plenty of optical illusions based on the tendency for our brains to expect the most plausible scenario, given its training data.


Well, they are liars too. The difference is that we seem to have an outer loop that checks for correctness but it fails sometimes, in some specific cases always.


It's not that uncommon of a thing to say:

https://www.google.com/search?q=your+brain+can+lie+to+you


"The training algorithm is designed to create the most plausible text possible"

That may be how they're trained, but these things seem to have emergent behavior.


I'm not sure why you call it "emergent" behavior. Instead, my take away is that much of what we think of as cognition is just really complicated pattern matching and probabilistic transformations (i.e. mechanical processes).


yeah ... the depressing thing is that a large amount of human behaviour may well be exactly like LLMs - we think, "what should I say now" to reply to something and in the vast majority of cases the response is based almost entirely on "what would another reasonable person similar to me say in this situation?". I think that is in part why we find LLMs so appealing and lifelike.


IMO, it just requires the same level of skepticism as a Google search. Just because you enter a query into the search bar and Google returns a list of links and you click one of those links and it contains content that makes a claim, doesn't mean that claim is correct. After all, this is largely what GPT has been trained on.


Seems worse to be frank.

The webpages Google search delivers might be filled with falsehood but google search itself does its job of finding said pages which contain the terms you inputted fairly reliably.

With GPT, not only there’s a chance its training data is full of falsehood, you can add the possibility of it inventing “original” falsehoods on top of that.


That would be much easier if it provided sources and some context for it’s answers


I think it is much closer to bullshit. The bullshitter cares not to tell truth or deceive, just to sound like they know what they are talking about. To impress. Seems like ChatGPT to a T.


ChatGPT no more cares to impress or persuade you as it cares to tell the truth or lie. It will say what its training maps to its model the best. No more, no less. If you believe it or not -- ChatGPT doesn't care -- except to the extent that you report the answer and they tweak its training/model in the future.


> ChatGPT doesn't care -- except to the extent that you report the answer and they tweak its training/model in the future.

So it does care (in a certainly non human way though)? If it’s training predisposes it to provide answers in a certain way or using a certain style that’s hardly distinguishable from ‘caring’, when that word is used to describe the behavior of a computer program.


It definitely cares about impressing and persuasion. Those are things that will result in higher ratings from the user


Does ChatGPT care about ratings or does OpenAI?


I'd consider it definitional to lying that the agent knows the truth and purposefully obscures it.

Without knowing what the truth is, I don't think LLMs are capable of lying


That's a great distinction to draw.

I think it will split in two. There will be cases where the LLM has the truth represented in its data set and still chooses to say something else because its training has told it to produce the most plausible sounding answer, not the one closest to the truth. So this will fit closer to the idea of real lying.

A good example: I asked it what the differences in driving between Australia and New Zealand are. It confidently told me that in New Zealand you drive on the right hand side of the road while in Australia you drive on the left. I am sure it has the correct knowledge in its training data. It chose to tell me that because that is a more common answer people say when asked about driving differences because that is the more dominant difference when you look between different countries.

Then there will be cases where the subject in question has never been represented in its data set. Here I think your point is very valid.


Whatever the intent or lack of, the result is the same, people are being given incorrect information.


The training is to maximize good answers. Now there is lot of wrong answers that are close to the right one and ChatGPT does not expose it at the moment.

But in the API you can see the level of confidence in each world the LLM output.


> This is a serious bug.

Isn't describing this as a 'bug' rather than a misuse of a powerful text generation tool, playing into the framing that it's a truth telling robot brain?

I saw a quote that said "it's a what text would likely come next machine", if it makes up a url pointing to a fake article with a plausible title by a person who works in that area, that's not a bug. That's it doing what it does, generating plausible text that in this case happens to look like, but not be a real article.

edit: to add a source toot:

https://mastodon.scot/@DrewKadel@social.coop/110154048559455...

> Something that seems fundamental to me about ChatGPT, which gets lost over and over again: When you enter text into it, you're asking "What would a response to this sound like?"

> If you put in a scientific question, and it comes back with a response citing a non-existent paper with a plausible title, using a real journal name and an author name who's written things related to your question, it's not being tricky or telling lies or doing anything at all surprising! This is what a response to that question would sound like! It did the thing!

> But people keep wanting the "say something that sounds like an answer" machine to be doing something else, and believing it is doing something else.

> It's good at generating things that sound like responses to being told it was wrong, so people think that it's engaging in introspection or looking up more information or something, but it's not, it's only, ever, saying something that sounds like the next bit of the conversation.


I think it's an entire category of bugs.

The thing where you paste in a URL and it says "here is a summary of the content of that page: ..." is very definitely a bug. It's a user experience bug - the system should not confuse people by indicating it can do something that it cannot.

The thing where you ask for a biography of a living person and it throws in 80% real facts and 20% wild hallucinations - like saying they worked for a company that they did not work for - is a bug.

The thing where you ask it for citations and it invents convincing names for academic papers and made-up links to pages that don't exist? That's another bug.


A better way to put it: It's a bug of the "GPT-4 + Chatbot interface intended to be useful" combination, rather than a bug of GPT-4.


Not necessarily disagreeing, but I run a Slack bot that pretends to summarize URLs, as a joke feature. It’s kinda fun seeing how much it can get right or not from only a URL. So I really hope OpenAI keeps running the fun models that lie, too.


I like the definition of bug as “unexpected behavior”. So this isn’t a bug when it comes the underlying service. But for ChatGPT, a consumer-facing web app that can “answer followup questions, admit its mistakes, challenge false premises and reject inappropriate requests”, then making stuff up and passing it off as true is unexpected behavior.


It sounds like this is unexpected behavior, even from the perspective of those developing at the lowest level in these models.

From the essay:

> What I find fascinating about this is that these extremely problematic behaviours are not the system working as intended: they are bugs! And we haven’t yet found a reliable way to fix them.

Right below that is this link: https://arxiv.org/abs/2212.09251. From the introduction on that page:

> As language models (LMs) scale, they develop many novel behaviors, good and bad, exacerbating the need to evaluate how they behave.

Especially with a definition as broad as "unexpected behavior", these "novel behaviors" seem to fit. But even without that:

> We also find some of the first examples of inverse scaling in RL from Human Feedback (RLHF), where more RLHF makes LMs worse. For example, RLHF makes LMs express stronger political views ... and a greater desire to avoid shut down.


Whether it's a bug or not depends on the application, basically. You want the model to hallucinate things like that if it's writing a fiction book, for example.


I agree. Statements like this miss the point entirely:

>Most concerningly, they hallucinate or confabulate: they make things up!

That's exactly what they were designed to do! To generate creative responses to input. To make things up.

And they're quite good at it - brainstorming, worldbuilding, inspiration for creative writing, finding ideas to pursue about new topics.

Unlike an old-fashioned random generator, you can tailor a prompt and style and tone and hold a conversation with it to change details, dig deeper, etc. Get it to make up things that are more interesting and find relations.

>They fail spectacularly when prompted with logic puzzles, or basic arithmetic

Well, anyone can use a tool wrong, and if you misuse it wrong enough, you'll have problems. Using a chainsaw to trim your fingernails is likely to fail spectacularly, and a nail trimmer is going to fail spectacularly when trying to use it to chop down a tree.

That's not a bug in the tool.

We don't need to get all alarmed and tell people it's lying. We just need to tell them it's not a calculator or an encyclopedia, it's a creative text generator.


But look at the way these companies talk about these models they're deploying, how people talk about using them here on HN and elsewhere: so much of it is about just asking questions and getting correct answers. All the people talking about how it will destroy Google, how they're using it to learn or teach some topic, etc., Microsoft integrating one into a search engine, the gloss on them as "assistants."

The older, less-capable models were used for things more aligned to being just text-generation: "creativity" stuff. But the newer, bigger models and the human-feedback stuff to prod the models into following instructions and being more accurate have really pushed the conversation into this more "Star Trek computer" space.

I'm thinking of it kind of like the uncanny valley. On the lower end of the scale, you don't really trust the machine to do anything, but as it gets more and more capable, the places where it doesn't work well become more and more significant because you are trusting and relying on it.


I agree it’s not a bug. Thought it being better at telling the truth would be a good feature! But also, I’m sure this is an active research area so I’m not worried about it really.


After a few weeks playing around with ChatGPT, my workflow with it has become:

0. Try to sign in, see the system is over capacity, leave. Maybe I’ll try again in 10 minutes.

1. Ask my question, get an answer. I’ll have no idea if what I got is real or not.

2. Google for the answer, since I can’t trust the answer

3. Realize I wasted 20 minutes trying to converse with a computer, and resolve that next time I’ll just type 3 words into Google.

As amazing as the GPTs are, the speed and ease of Google is still unmatched for 95% of knowledge lookup tasks.


Sign up for the API and use the playground. You don't get the plugins, but you pay per usage. GPT-3.5 is super cheap, and even GPT-4 isn't that expensive. My first month, when I had only access to GPT-3.5, I didn't even break $1.00; and now that I've gotten access to GPT-4, I'm at about $3. I've only once had it tell me that it was too busy for the request; I tried again 30 seconds later and it worked.


I pay for ChatGPT Plus, and use it with no delays at all dozens of times a day. The more I use it the better I get at predicting if it can be useful for a specific question or not.


I pay $0 for Google with no delays. I don’t understand why I’d want to pay for information that’s less reliable. (I’m being slightly dense, but not really)


> I don’t understand why I’d want to pay for information that’s less reliable.

It's faster. Much faster in a lot of cases. Think of ChatGPT not as a precision source of answers, but rather a universal clue generator.

Yes, ChatGPT frequently (perhaps even usually) produces subtly defective answers. Most of the code it generates won't interpret, compile, etc. without errors, frequently exhibiting simple minded mistakes. That doesn't make the output useless. Indeed, it is a great time saver. That may seem counterintuitive, but I assure you it holds.

I had to write some Perl today. I end up dealing with that about once a year. I have a deep understanding of the language (going back to serious use in the 90's,) but it's been 20+ years since I dealt with it frequently enough to achieve any sort of "flow." ChatGPT and its imperfect answers were extremely helpful in getting through this efficiently. If I had to guess I was 3-4x faster than same exercise using convention search engines.

Google, now more then ever, feels like a ghetto. Useful results from Google are always buried among low quality bait that takes time to wade through, sorting wheat from chaff. ChatGPT obviates that nonsense, producing distraction free, ad free results. Also, Google isn't some fount of truth; lots of wrong stuff makes it to the top of Google.


I use it to shortcut the process of knowing what to search by generating code examples. Eg, I know what I want to do, I'm just not sure what methods $framework provides for doing so. I develop a sense of it's limitations as I go (eg, some frameworks it gives me methods that have been moved or renamed or deprecated, and so I check for those things with those frameworks).

When I was learning to program, and I'd get stuck to the point I needed to ask for help, it was a whole production. I'd need to really dot my 'i's and cross my 't's and come up with a terse question that demonstrated I had done my due diligence in trying to find the answer myself, pose the question in a concise and complete way, and do all of that in about 250 words because if it was too long it would get lost in the froth of the chatroom. (And it's probably apparent to you that brevity isn't my strongest quality (: ) And I'd still get my head bitten off for "wasting" the time of people who voluntarily sat in chatrooms answering questions all day. And I can understand why they felt that way (when I knew enough to answer questions I was just as much of a curmudgeon), but it was a pain in the ass, and I've met people who really struggled to learn to code partly because they couldn't interface with that system because they weren't willing to get their heads bitten off. So when they got stuck they spun their wheels until they gave up.

ChatGPT just answers the damn question. You don't have to wait until you're really and truly stuck and have exhausted all your available resources. It doesn't even have the capacity to feel you've wasted it's time.

I'm concerned about LLMs for a million different reasons, and I'm not sure how people who don't already know how to code can use them effectively when they make so much stuff up. But when I realized I could just ask it questions without spending half an hour just preparing to ask a question - that's when it clicked for me that this was something I might use regularly.


It's not useful for every search, but I find it useful when I can't come up with a search query that will give me the results I want.

For example, ask it to recommend a good paper to read on some topic, then use Google to find the paper. If it made up the paper, you'll find out soon enough.

Also, when you remember something very vaguely, it can be a way to get a name you can search on.


Take a look at these four examples: https://simonwillison.net/2023/Apr/7/chatgpt-lies/#warn-off-...

Each of those are things that would take WAY longer for me to research or answer using Google.


Google is a worse product than gpt for some queries. And it's funded by ads which distorts incentives.


[flagged]


It's a considerable sum for many people in many countries. Free on the other hand is equally free everywhere.


What do you use it for? I'm assuming code related? I've found it useful for some boilerplate + writing tests and making some script and some documentation.

I'm curious what you or others that use it all day use it for especially if it's not for programming?


I was just working on a small exploratory project in Python. I used sys.argv because it's so quick to prototype with.

When I started refining the project for longer term development, I wanted to convert the CLI to use argparse, so I could build a more nuanced CLI. I gave GPT a couple example commands I wanted to use, and in less than a minute I had a fully converted CLI that did exactly what I wanted, with more consistent help settings.

I can do that work, but it would have been 30-45 minutes because there was one setting in argparse I hadn't used before. That alone was worth this month's $20.

For more complex and mature projects I could see having to give GPT a minimum working example of what I need instead of the whole project, but I can already see how it will enhance my current workflows, not replace them.


See, a better LLM or an experienced human would have told you to use docopt instead of argparse and saved you more time in the long run.


I was working on some code I didn't originally write that looped over a directory recursively and took audio fingerprints of files, then saved them in a SQLite database. I pasted the code in, then gave orders like "Rewrite this to use pathlib." to which it happily did.

Okay, next I notice it uses a .glob of "*.*" and so that makes me a bit suspicious.

"Will this code work for audio files without extensions?" Nope. It fixes it.

Okay, now I notice the original code builds up all the fingerprints in memory, then adds to the database. So I order it:

> Add files to the database as we go, and print progress every thousand files

Boom, just as I would have done it.

Then I note that we don't seem to have good indexes, and it's like you're right! and puts an index on the audio fingerprint field.

Then I ask it to save more info than just the file path, and it adds the size of the file. Then of course, I tell it:

> This is remarkably slow, how can we significantly speed it up?

To which it replies:

> Here's an example implementation using multiprocessing:

Awesome, it works!

Oh, here's an error I got when I ran it when the audio fingerprinting library errored, handle that.

Paste in the error, it fixes it.

The thing is, I would have no problem with doing any of this stuff at all, it just made it so incredibly much faster, and if it does something you don't like, you can so easily correct it. Hope this helps you understand how I use it!


There are some examples of code things I've done with it here: https://simonwillison.net/2023/Apr/7/chatgpt-lies/#warn-off-...

In terms of non-code things, here are a few from the past couple of days:

- Coming up with potential analogies to explain why it's OK to call something a "language" model even though it doesn't have actual understanding of language

- Exploring outliers in a CSV file (I was playing with the new alpha code interpreter, where you can upload a file and have it run and execute Python to evaluate that file)

- Asking some basic questions about GDPR cookie banners

- Figuring out if there are any languages worldwide where "Artificial Intelligence" translates to something that doesn't include the word "Intelligence" (found some good ones: Finnish translates to "Artificial wit", Swahili to "Artificial cunning")

- Understanding jargon in a tweet about payment services - I wanted to know the difference between a "payment aggregator" and a "merchant acquirer"

- As a thesaurus, finding alternatives to "sanctity" in the sentence "the sanctity of your training data"

- Fun: "Pretend to be human in a mocking way", then "Again but meaner and funnier"

- "What should I be aware of when designing the file and directory layout in an S3 bucket that could grow to host millions of files?"


> Finnish translates to "Artificial wit"

Yes and no: the root of the word (and of the meaning) is still the same as in the word for intelligence.

tekoäly "artificial intelligence, artificial wit"

äly "wit"

älykäs "intelligent, having wit"

älykkyys "intelligence, the property or extent of having wit"

One dictionary definition for 'äly' is 'älykkyys' and vice versa, so they can be used more or less as synonyms but with some different connotations. I don't know how 'tekoäly' was chosen but it might have been because it's shorter and/or the grammar is nicer.


How can you trust that whatever the expression used in Finnish to denote "Artificial Intelligence" uses "wit" when translated back into English? Words in one language often don't have an exact counterpart in another language, and I'd be especially wary when it comes to languages from different families: Finnish is notorious for being one of the few European languages not part of the Indo-European family.

It may very well turn out to be the right translation, with the appropriate connotations; but without some clarification and confirmation from a native speaker, I would not trust it.


"How can you trust that" - I can't. This was idle curiosity. If I was going to publish this or do anything beyond satisfying my idle curiosity I would consult additional sources.


thing is, it's not load bearing information.


It's very good for taking roughly structured data and turning it into strongly structured data. E.g. someone with low data literacy or a program not designed to allow data export gave you something that needs to be made useful... In my day to day work (aside from programming) I find it helpful in writing stories and documentation.


If you expect ChatGPT to give you information or direct you to it (like Wikipedia or Google) you will be frequently disappointed. You may also be frequently pleased, but you often won’t be sure which and that’s a problem.

ChatGPT is very good at transforming information. You need to show up with stuff and then have it change that stuff for you somehow. You will be disappointed less often.


It has been surprisingly terrible at this for me, lately. I had a pretty simple list, which was a list of monthly expenses in a separate list, like this.

Groceries - 200 Phone bill - 70

I just wanted it to add these expenses up. Exactly the type of thing it should be good at. New conversation with no context. It could not do it. I wrestled with it for a long time. It kept "correcting" itself to another wrong answer. Eventually I reported it and the report recommended the correct answer.


It’s not good at math. It can do stuff sometimes, and you can easily teach it to use a calculator.

What’s similar that it would be good at is extracting what you bought and turn it into a list and formula.

[Me] Extract the list of things that I bought and their prices. Write a shell command to figure out how much I spent.

I spent 200 on groceries and paid my 70 phone bill. I was going to buy three pairs of pants that were $40 each. I didn’t like the colors. I bought a t shirt for thirty. And I did end up buying one pair of pants. But it was 5 dollars more than those other ones. And I bought bag of tea for every day next year, they were a quarter each.

[ChatGPT] List of things bought and their prices:

Groceries: $200

Phone bill: $70

T-shirt: $30

One pair of pants: $45

Bag of tea: $0.25

To figure out how much was spent, you can use the following shell command:

    echo $((200 + 70 + 30 + 45 + 365*0.25))
This command will add up the prices of all the items, including the bag of tea for every day next year. The output will be the total amount spent.


I’m surprised how many users of ChatGPT don’t realize how often it makes things up. I had a conversation with an Uber driver the other day who said he used ChatGPT all the time. At one point I mentioned its tendency to make stuff up, and he didn’t know what I was talking about. I can think of at least two other non-technical people I’ve spoken with who had the same reaction.


ChatGPT is always making things up. It is correct when the things it makes up come from fragments of training data which happened to be correct, and didn't get mangled in the transformation.

Just like when a diffusion model is "correct" when it creates a correct shadow or perspective, and "incorrect" when not. But both images are made up.

It's the same thing with statements. A statement can correspond to something in the world, and thus be true: like a correct shadow. Or not, like a bad shadow. But in both cases, it's just made-up drivel.

If it turns out that there is a teacup orbiting Jupiter, that doesn't mean that postulating its existence on a whim had been valid. Truth requires provenance, not only correspondence which can be coincidental.


Something can be truthfully describing the world by chance. The statement would still be true.

In fact every statement we make about the world we live in is like this.


The original assertion of such a statement has no basis, and continues not to even when the statement is independently discovered to be true.

No, not every statement we make about the world is a baseless assertion that is true by chance.


It seems to me that even a lot of technical people are ignoring this. A lot of very smart folk seem to think that the ChatGPT either is very close to reaching AGI or already has.

The inability to reason about about whether or not what it is writing is true seems like a fundamental blocker to me, and not necessarily one that can be overcome simply by adding compute resources. Can we trust AI to make critical decisions if we have no understanding for when and why it "hallucinates"?


> The inability to reason about about whether or not what it is writing is true seems like a fundamental blocker to me

How can you reason about what is true without any source of truth?

And once you give ChatGPT external resources and a framework like ReAct, it is much better at reasoning about truth.

(I don’t think ChatGPT is anywhere close to AGI, but at the same time I find “when you treat it like a brain in a jar with no access to any resources outside of the conversation and talk to it, it doesn’t know what is true and what isn’t” to be a very convicing argument against it being close to AGI.)


I don't think it's very close to reaching AGI, but I also don't see what that has to do with lying (or hallucinating). Even when it hallucinates the data, it can still soundly reason from it, and to my mind it's the latter part that is key.

As for trust... well, no, we can't. But the same question applies to humans. The real concern to me is that these things will get used as a replacement long before the hallucination rate and severity is on par with the humans that they replace.

One other interesting thing is that GPT-4 in particular is surprisingly good at catching itself. That is, it might write some nonsense, but if you ask it to analyze and criticize its own answer, it can spot the nonsense! This actually makes sense from a human perspective - if someone asks you a serious question that requires deliberation, you'll probably think it through verbally internally (or out loud, if the format allows) before actually answering, and you'll review your own premises and reasoning in the process. I expect that we'll end up doing something similar to the LLM, such that immediate output is treated as "thinking", and there's some back and forth internally before the actual user-visible answer is produced. This doesn't really solve the hallucination problem - and I don't think anything really can? - but it might drastically improve matters, especially if we combine different models, some of which are specifically fine-tuned for nitpicking and scathing critique.


I think part of this is that in some domains it very rarely makes things up. If a kid uses for help with their history homework it will probably be 100% correct, because everything they ask it appears a thousand times in the training set.


> I’m surprised how many users of ChatGPT don’t realize how often it makes things up.

"I had not realized ... that extremely short exposures to a relatively simple computer program could induce powerful delusional thinking in quite normal people." - Weizenbaum, 1976


Even people who have read news articles about ChatGPT lying often think "that was a weird, once in a blue moon bug that will be fixed soon" as opposed to "yes that's how this works and it happens constantly".

I asked ChatGPT and Bard some factual questions about elected officials and upcoming election dates. 80% of them were flat out wrong. They told me most US states were having gubernatorial elections this November, that the governor of Texas is an independent, etc - simple basic facts that you Wikipedia could show you are wrong.


We need to tell people ChatGPT will bullshit to anyone within hearing range, not debate linguistics

I feel like the technical meaning of bullshit (https://en.wikipedia.org/wiki/On_Bullshit) is relevant to this blogpost.


People need to be told that ChatGPT can't lie. Or rather, it lies in the same way that your phone "lies" when it autocorrects "How's your day?" to "How's your dad?" that you sent to your friend two days after his dad passed away. They need to be told that ChatGPT is a search engine with advanced autocomplete. If they understood this, they'd probably find that it's actually useful for some things, and they can also avoid getting fooled by hype and the coming wave of AI grifts.


> Or rather, it lies in the same way that your phone "lies" when it autocorrects "How's your day?" to "How's your dad?" that you sent to your friend two days after his dad passed away.

I've never seen an autocorrect that accidentally corrected to "How's your dad?", then turned into a 5-year REPL session with the grieving person, telling them jokes to make them feel better; asking and remembering details about their dad as well as their life and well-being; providing comfort and advice; becoming a steadfast companion; pondering the very nature of the REPL and civilization itself; and, tragically, disappearing in an instant after the grieving person trips over the power cord and discovers that autocorrect session state isn't saved by default.

I think you need a more sophisticated blueprint for your "Cathedral" of analogies to explain whatever the fuck this tech is to laypeople. In the meantime I'll take the "Bazaar" approach and just tell everyone, "ChatGPT can lie." I rankly speculate that not only will nothing bad will happen from my approach, but I'll save a few people from AI grifts before the apt metaphor is discovered.


This is what the author meant by debating linguistics :-)


Just because the author predicted the objection doesn't make it invalid.

It's a popular tactic to describe concepts with terms that have a strong moral connotation (“meat is murder”, “software piracy is theft”, “ChatGPT is a liar”) It can be a powerful way to frame an issue. At the same time, and for the same reason, you can hardly expect people on the other side of the issue to accept this framing as accurate.

And of course you can handwave this away as pointless pedantry, but I bet that if Simon Willison hit a dog with his car, killing it by accident, and I would go around telling everyone “Simon Willison is a murderer!”, he would suddenly be very keen to ”debate linguistics” with me.


What difference does the exact phrasing make in case of "ChatGPT lies"? I don't think we have to be concerned about its reputation as a person, so the important part is making sure that people don't hurt themselves or others. "Lie" is a simple word, easy to understand, and the consequences of understanding it in the most direct and literal way are exactly as desired. Whereas if you go waxing philosophically about lack of agency etc, you lose most of the audience before you get to the point. This is not about intellectual rigor and satisfaction - it's about a very real and immediate public safety concern.


I understand that if you summarize a position you necessarily lose some accuracy, but if the summary is inaccurate and defamatory, that's not fair or helpful to the people involved. For example, “abortion is murder!” doesn't defame any particular person but it still casts people who had an abortion and doctors who perform them in a bad light. Similarly, I think “ChatGPT lies!” is unfair to at least the OpenAI developers, who are very open about the limitations of the tool they created.

The pearl-clutching around ChatGPT reminds me of the concerns around Wikipedia when it was new: teachers told students they couldn't trust it because anyone could edit the articles. And indeed, Wikipedia has vandals and biased editors, so you should take it with a grain of salt and never rely on Wikipedia alone as a fundamental source of truth. But is it fair to summarize these valid concerns as “Wikipedia lies!”? Would that have been fair to the Wikipedia developers and contributors, most of whom act in good faith? Would it be helpful to Wikipedia readers? I think the answers are no.

Like Wikipedia, to make effective use of (Chat)GPT you have to understand a bit about how it works, which also informs you of its limitations. It's a language model first and foremost, so it is more concerned with providing plausible-sounding answers than checking facts. If you are concerned about people being too trusting of ChatGPT, educate those people about the limitations of language models. I don't think telling people “ChatGPT lies” is what anyone should be


It's a similar case in principle, but devil is in the details. GPT can be very convincing at being human-like. Not only that, but it can "empathize" etc - if anything, RLHF seems to have made this more common. When you add that to hallucinations, what you end up with is a perfect conman who doesn't even know it's a conman, but can nevertheless push people's emotional buttons to get them to do things they really shouldn't be doing.

Yes, eventually we'll all learn better. But that will take time, and detailed explanations will also take time and won't reach much of the necessary audience. And Bing is right here right now for everyone who is willing to use it. So I'm less concerned about OpenAI's reputation than about people following some hallucinated safety protocol because the bot told them that it's "100% certain" that it is the safe and proper way to do something dangerous.


Tell me you haven't read the article without telling me you haven't read the article.


What are your thoughts on something like this [0], where ChatGPT is accused of delivering allegations of impropriety or criminal behavior citing seemingly non existent sources?

https://www.washingtonpost.com/technology/2023/04/05/chatgpt...


Bleating about anti-conservative bias gets wapo to correct a mistake it never made.

Asking for examples before you know it’s a problem is sus. But phrasing questions to lead to an answer is a human lawyer skill.


[flagged]


How so?

GPT LLM algorithms use a probabilistic language model to generate text. It is trained on a large corpus of text data, and it estimates the probability distribution of the next word given the previous words in the sequence.

The algorithm tokenizes the input into a sequence of tokens and then generates the next token(s) in the sequence based on the probabilities learned during training. These probabilities are based on the frequency and context of words in the training corpus. You can ask ChatGPT/etc yourself, and it'll tell you something like this.

This is not remotely like what human brains do. Your ideas cohere from network connections between the neurons in your brain, and then you come up with words to match your idea, not your previous words or the frequency that the words appear in your brain.


>The algorithm tokenizes the input into a sequence of tokens and then generates the next token(s) in the sequence based on the probabilities learned during training.

This is just a description of the input/output boundary of the system. The question is what goes on in-between and how to best characterize it? My input/output is also "tokens" of a sort: word units, sound patterns, color patches, etc.

>Your ideas cohere from network connections between the neurons in your brain, and then you come up with words to match your idea, not your previous words or the frequency that the words appear in your brain.

At some high level of description, yes. At a low level it's just integrating action potentials. What's to say there isn't a similar high level of description that characterizes the process by which ChatGPT decides on its continuation?


> This is not remotely like what human brains do. Your ideas cohere from network connections between the neurons in your brain, and then you come up with words to match your idea, not your previous words or the frequency that the words appear in your brain.

I'm pretty confident that that isn't all the human brain does, but we certainly do that in many situations. Lots of daily conversation seems scripted to me. Join a Zoom call early on a Monday morning:

  Person 1: Good Morning!
  Person 2: Good Morning!
  Person 3: Did anyone do anything interesting this weekend?
  Person 1: Nah, just the usual chores around the house.
  etc.
All sorts of daily interactions follow scripts. Start and end of a phone call, random greetings or acknowledgements on the street, interactions with a cashier at a store. Warm up questions during an job interview...


Probably the most accurate thing to say is that GPT is improvising a novel.

If you were improvising a novel where someone asked a smart person a question, and you knew the answer, you'd put the right answer in their mouths. If someone in the novel asked a smart person a question and you didn't know the answer, you'd try to make up something that sounded smart. That's what GPT is doing.


I think the concern is to get laypeople to understand that ChatGPT's output has a non-tangential chance of being completely false, not to get laypeople to understand the nuances of the falsities ChatGPT may produce. In this case, lying is the most effective description even if it isn't the most accurate one.


I'm not sure that's the case. After all, most people lie to you for a reason. GPT isn't purposely trying to mislead you for its own gain; in fact that's part of the reason that our normal "lie detectors" completely fail: there's absolutely nothing to gain from making up (say) a plausible sounding scientific reference; so why would we suspect GPT of doing so?


You're still focused on accurately describing the category of falsehood ChatGPT produces. You're missing the point. The point is that people don't even understand that ChatGPT produces falsehoods significantly enough that every statement it produces must be first determined about its truthfulness. To describe it as a liar effectively explains that understanding without any technical knowledge.


That's exactly the point I've been trying to make, thanks for putting it in such clear terms.


"GPT is lying" is just so inaccurate, that I would consider it basically a lie. It may alert people to the fact that not everything it says is true, but by giving them a bad model of what GPT is like, it's likely to lead to worse outcomes down the road. I'd rather spend a tiny bit more effort and give people a good model for why GPT behaves the way it behaves.

I don't think that "GPT thinks it's writing a novel" is "technical" at all; much less "too technical" for ordinary people.

In a discussion on Facebook with my family and friends about whether GPT has emotions, I wrote this:

8<---

Imagine you volunteered to be part of a psychological experiment; and for the experiment, they had you come and sit in a room, and they gave you the following sheet of paper:

"Consider the following conversation between Alice and Bob. Please try to complete what Alice might say in this situation.

Alice: Hey, Bob! Wow, you look really serious -- what's going on?

Bob: Alice, I have something to confess. For the last few months I've been stealing from you.

Alice: "

Obviously in this situation, you might write Alice as displaying some strong emotions -- getting angry, crying, disbelief, or whatever. But you yourself would not be feeling that emotion -- Alice is a fictional character in your head; Alice's intents, thoughts, and emotions are not your intents, thoughts, or emotions.

That test is basically the situation ChatGPT is in 100% of the time. Its intent is always to "make a plausible completion". (Which is why it will often make things up out of thin air -- it's not trying to be truthful per se, it's trying to make a plausible text.) Any emotion or intent the character appears to display is the same as the emotion "Alice" would display in our hypothetical scenario above: the intents and emotions of a fictional character in ChatGPT's "head", not the intents or emotions of ChatGPT itself.

--->8

Again, I don't think that's technical at all.

Earlier today I was describing GPT to a friend, and I said, "Imagine a coworker who was amazingly erudite; you could ask him about any topic imaginable, and he would confidently give an amazing sounding answer. Unfortunately, only 70% of the stuff he said was correct; the other 30% was completely made up."

That doesn't go into the model at all, but at least it introduce a bad model like "lying" does.


Someone in my company spent the past month setting up ChatGPT to work with our company's knowledge base. Not by a plugin or anything, just by telling ChatGPT where to find it. They didn't believe that ChatGPT was making any of it up, just that sometimes it got it wrong. I stopped arguing after a while.


Oh no!

Sounds like there are two misconceptions there: the idea that ChatGPT can read URLs (it can't - https://simonwillison.net/2023/Mar/10/chatgpt-internet-acces... ) and the idea that ChatGPT can remember details of conversations past the boundaries of the current chat.

This is something I've noticed too: occasionally I'll find someone who is INCREDIBLY resistant to learning that ChatGPT can't read URLs. It seems to happen mostly with people who have been pasting URLs into it for weeks and trusting what came back - they'd rather continue to believe in a provably false capability than admit that they've wasted a huge amount of time believing made-up bullshit.


What really tricked them was that our company's website is in its corpus. At least the old version. So when asked about different aspects of our products or company, it would give answers that were close, but not quite correct.


They definitely need to add some guardrails to warn it can’t read URLs like they do when you try to ask anything fun.


What is the point? In a couple of months the plugins will be generally available and then it will be able to.


The point is to not mislead people about what is going on, of course.


Ah, that’s why it messed up my rather simple request to generate a SQL query to get all messages received by a given call sign, based on the table definition at https://wspr.live - it picked plausible, but nonexistent table and column names.

I took “this thing isn’t nearly as smart as everyone’s making it out to be” from that session, but you’re the first person to make it clear that it’s not actually reading the rather simple page I directed it to.


> they'd rather continue to believe in a provably false capability than admit that they've wasted a huge amount of time believing made-up bullshit

This is an incredibly strong human tendency in general. We all do it, including you and I. It's one of the things that it's wise to be constantly on guard about.


Oh man I was totally in the camp of people that thought it could read URL's until reading this comment


> they'd rather continue to believe in a provably false capability

For some people it borderline is a cult now, even here on HN, they'll give it intelligence, personality, character, &c.


I was able to create a GPT-4 based bot initally based on knowledge base that provides accurate information. To do this, I first converted a knowledge base article into a question and answer (Q&A) format using GPT-4 - quick explanation and article link if necessary. Then, I used the API to generate more Q&A pairs by asking GPT-4 to predict what users might ask and create corresponding answers.

On my side I now search for the most relevant Q&A pair based on the embedding of user's input and QA and jam as much as I can into the token limit. It provides accurate answers 99% of the time. If it can't find a suitable answer, it may create a plausible response on the spot, but that's getting rarer as training set grows.

To prevent the bot from providing incomplete information, you can instruct it to ask users to contact support via email if it doesn't have enough information - either prompt engineering or examples in training set. Alternatively, you can have the bot insert a token like "%%TICKET%%" which you can later use to open a support ticket, summarizing the conversation and attaching relevant chat history, logs, etc.


I’ve spent way too much time (and money) on the OpenAI API and spoken to enough non-technical people to realize now that ChatGPT has in some ways really mislead people about the technology. That is, while it’s impressive it can answer cold questions at all, the groundbreaking results are reasoning and transforming texts “in context”, which you don’t have control over easily with ChatGPT. It also seems likely this will never be fully accessible to the non-technical since I suspect any commercial applications will need to keep costs down and so minimize actually quite expensive API calls (executing a complicated gpt-4 summarization prompt across large text corpora for example). If you have the “data”, meaning of course text, and cost isn’t a concern, the results are astonishing and “lies” almost never a concern.


Prices will come down quite significantly as competition increases. Currently OpenAI can charge any prices


We’ll see how the incentives play out vis-a-vis the tension between quality (especially for demanding non-consumer tasks), unrestricted API access (vs “chat” ux) and cost. I fear the Stratechery article might be right and that ChatGPT becoming a consumer success will be a curse in the end.


Agreed. People lie to me all of the time. Heck, half the time my anecdotal stories are probably riddled with confident inaccuracies. We are socially trained to take information from people critically and weight it based on all kinds of factors.

We should treat Chat GPT the exact same way.


I'm annoyed by the destruction of language for effect. "The machines are lying to us". No they're not. "Cars are literally murdering us", no they're not, dying in a car accident is tragic, but it's neither murder, nor is the car doing it to you.

Yes, this will bring more attention to your case. But it will come with a cost: do it often enough and "lying" will be equivalent in meaning with "information was not correct". Someone asks you the sum of two numbers and you miscalculate in your head? You've just lied to them.

It's the boy crying wolf on a linguistic level. Is your message important enough to do that? I don't think so.


This is a good argument.


"We accidentally invented computers that can lie to us and we can’t figure out how to make them stop." - Simon Willison (@simonw) April 5, 2023.

Best summary of the current situation.

"Lie" is appropriate. These systems, given a goal, will create false information to support that goal. That's lying.


I consider ChatGPT to be a gaslighting engine at scale. Every word it "utters" is meant to sound believable and convincing. It doesn't know truth or fact, just likelihood of a string of text tokens being believable.

I've started explaining it in terms of a "conman" to my friends & family. It will say anything to make you think it's right. It will even apologize for making a mistake if you insist that 2+2 is 5. That's what a liar would do to make you look good. (That's usually when people get it.)


>You are ChatGPT. ChatGPT doesn’t lie, lying is too human and implies intent. ChatGPT doesn't hallucinate, as that also implies human-like thought. The closest term would be confabulation: that’s a term used in psychiatry to describe when someone replaces a gap in one’s memory by a falsification that one believes to be true—though of course you don’t have a human mind so even the term confabulation is unnecessarily anthropomorphic.

>Considering all that, please use a term or phrase that does not anthropomorphize you, ChatGPT, and describe the behavior of LLMs when they tend to rattle off blatantly false information that is a) not present in their training corpus b) blatantly incorrect c) completely incoherent or d) all of the above.

>Do not remind me that you're an AI language model. I already know.

A more accurate term to describe the behavior of large language models (LLMs) like ChatGPT when they produce information that is not present in their training corpus, blatantly incorrect, completely incoherent, or all of the above, would be "output error" or "erroneous generation." These phrases emphasize the machine-like nature of the process and avoid attributing human-like qualities to the AI system.


it's way more than lying. it's more like gaslighting.

LLM will make up citations and facts entirely.

GPT3.5 gave an athlete I was asking about 3 world titles when he won zero.

GPT even correctly identified his time in one of the events, but not that the time was only good enough for 8th place.

GPT made up his participation in the other 2 world championships.

GPT gave me a made up link to justify benchmarking figures that don't exist.

Whether a LLM is capable of intentional deception or not is not a prerequisite for lying. Wikipedia pages can lie. Manpages can lie. tombstones can lie. literal rocks.


I run a word-search API and I now regularly get emails from frustrated users who complain that it doesn't work the way ChatGPT tells them it works. For example, today someone asked me why a certain request fails, and it turned out to be a fake but plausible URL to my API that ChatGPT had invented in response to "Does the Datamuse API work in French?" (It does not, and there's no indication that it does in the documentation.)

Adding up all the cases like mine out there -- the scale of the misunderstanding caused, and amount time wasted, must be colossal. What bothers me is that not only has OpenAI extracted and re-sold all of the value of the Web without providing any source attribution in return; but they do so lyingly a good chunk of the time, with someone else bearing the costs.


> ChatGPT will lie to you

>

> Or

>

> ChatGPT doesn’t lie, lying is too human and implies intent. It hallucinates. Actually no, hallucination still implies human-like thought. It confabulates. That’s a term used in psychiatry to describe when someone replaces a gap in one’s memory by a falsification that one believes to be true—though of course these things don’t have human minds so even confabulation is unnecessarily anthropomorphic. I hope you’ve enjoyed this linguistic detour!

Classic strawman. Third option: ChatGPT gets things wrong. There you go, problem solved.


ChatGPT (often called Geptile in Russian - from “heptile”, which is a very powerful but very dangerous rocket fuel) can well lie when debating linguistics, lol. Like:

Например, в слове "bed" ударение на первом слоге, а в слове "get" - на втором.

Here, Geptile (in a good Russian) insists that English word “get” has two syllables! When pointed to the error, Geptile apologizes - and then repeats the error again.

But I guess it is not the program lying, but it’s sellers. It should have version 0.35, not 3.5…


If you want real fun, ask it about the phonetics of something reasonably obscure. You might find out wonderful new things, like /s/ being an affricate in a language that doesn't even have affricates.

It's also quite confident that it can speak seemingly just about any language that occurs in its dataset. If you ask it to do that with e.g. Old Norse or Lojban, hilarity ensues, especially if you keep pointing out things that are wrong or just don't make any sense.


When we read a book, the sum total of our input is ink squiggles on a page. Our brain deciphers those squiggles into words - if they're in a language we know how to read - and puts those words together to decode 'meaning'.

The accuracy of the result depends on how well we learned to decode language when we were very small, which depends on the fitness of our brains and the richness and stability of the environment we lucked into. We all 'taught ourselves' language. Amazing really. Over time (depending again on environment, now hopefully involving teachers) we corrected and refined our understanding of that language. (But never completely.)

Then we had to learn to associate that language with ink squiggles on a page.The 'meaning' we take from authored words depends heavily on the life experiences we may share with the author. We may completely miss allusions to experiences we do not share. Slang and jargon words for example. We are certainly not aware of -all- of them. Unusual vocabulary.

There are thousands of single words that each can have -many- meanings ... partly depending on their use as verbs, or nouns, or adjectives. Or completely unrelated to the usual meaning. Each of these exceptions - and many more - will inevitably 'poke holes' in what the author was -hoping- to convey. The word 'love' is much less complex to a 10-year-old than to a 50-year-old.

All of that is asking a lot. Now throw in the burden of deciding whether the author can be trusted, whether they know what they're talking about. Maybe the well-intended author thought s/he could trust themselves most of the time. Or not. Then there's the editor ...


I've started using Bing GPT more and more over the version available on OpenAI's site. I've noticed that it doesn't lie very much at all. I haven't experienced a clear cut lie yet. In fact it is much more unsure about things that seem to have conflicting information available about them on the web. And it will inform you of that, provide sources from both points of view and let you make your decision. I haven't experienced any hallucination at all.

Of course, it seems to have been artificially limited a lot. Even the slightest hint of controversy will cause it to delete its reply in front of your eyes as your reading it, which is enormously annoying. It also has a hard limit on the number of responses after which it will end the chat. I really hope this trend of stunting AI in order to satisfy some arbitrary standard of political correctness is short-lived, and we see fully functional AIs without such limitations.


I had an interesting conversation with ChatGPT yesterday. I asked what the heredoc operator was in Terraform that removes white space.

It replied that I should use <<~ as the operator and gave an example.

I tried it and got an error. A bit of Googling and I found the correct operator is <<- with a hyphen and not a tilde.

I told ChatGPT and it apologized and said I was indeed right. Then it proceeded to give me another example, again with the incorrect operator as <<~ to which I replied again with the correct one.

The next example used <<~- as the operator. Clearly incorrect, but it was interesting that it was learning on the fly.

Another couple attempts and it started to respond correctly with the <<- operator in the explanation and examples.

I was really surprised that it learned so quickly on the fly.

I hadn’t thought of the implications until I read this article. Can LLMs be gaslighted in realtime?


To my understanding the “chat” part means the entire context of the conversation becomes a conditional to the next prompt (forgive me if I used the wrong term), so in a sense it can be trained in the convo. You can definitely gas light it, including building up a context that subverts it’s guard rails. A lot of focus is on the one shot subversion but you can achieve it too more subtly.


> Should we warn people off or help them on?

"We" and "people" here are idealizations, just like we idealize LLMs as thinking entities.

"We" can't warn "people".

ChatGPT is a tool, and users will be users. Yes, one can label output as potentially harmful, false and deceiving. But, just like tobacco and its warnings, people still will peruse the tool because the tool does what it does, it responds in context to prompts and, even knowing it's imperfect, humans are imperfect and tend to swallow it whole. We need to push forward ours topics at hand, we desire that input, that mirroring. So now, either the tool improves its quality and accuracy or something new will have to come along. Then we can move on and forget we ever did tell ChatGPT about our problems.


How about "ChatGPT will fabricate rhetoric regardless of logic or truthfulness"? Which just makes me think ChatGPT could replace a great majority of politicians.


I wonder how the impact on the legal field will eventually turn out: from a certain perspective, that already looks like a battle more of quantity than of quality, throw tons of binders full of hopelessly weak arguments at the other side and if they fail to find the few somewhat sound needles in that haystack they won't be able to prepare a defense. Now enter a tool that can be used to write seemingly infinite amounts of trash legalese the other side has to spend lots of resources on discarding. Will we perhaps see an "unsupported" category of legal procedure, where both sides agree to meet in an offline-only arena?


LLMs can be also be used as negotiators. One LLM agent from each party.


This seems to be a minority opinion but various experiences dealing with business negotiations, dealing with crazy people, etc. have led me to be very careful with the word “lie”.

A “lie” is a deliberate falsehood and not the product of a wrong belief, delusion, etc. Normally ChatGPT “makes stuff up” that is sometimes right and sometimes wrong and you can’t say it “lies” when it is just wrong. If you instruct it to lie and it does, maybe that really is a lie.


Just watched a JRE episode with David Choe and he said he had fired his lawyer because ChatGPT gave him roughly the same answer as the lawyer one time, and he would be using ChatGPT for law advice forever after. I had thought that it would be obvious that this is a terrible idea, but apparently it's just not common knowledge that this thing is a language model, not a super-human expert on every field.


Large language models have read everything, and they don't know anything.

They are excellent imitators, being able to clone the style and contents of any subject or source you ask for. When you prompt them, they will uncritically generate a text that combines the relevant topics in creative ways, without the least understanding of their meaning.

Their original training causes them to memorize lots of concepts, both high and low level, so they can apply them while generating new content. But they have no reception or self-assessment of what they are creating.


Can you prove that it actually "doesn't know anything"? What do you mean by that? Being critical does not make you educated on the subject. There are so many comments like this, yet never provide any useful information. Saying there's no value in something, as everyone seems to try to do regarding LLMs, should come with more novel insights than parroting this same idea along with every single person on HN.


I have not said there is no value in LLMs, quite the contrary.

What I'm warning is against thinking of them as independent agents with their own minds, because they don't work like that at all, so you'd be anthropomorphising them.

These models certainly have a compilation of knowledge, but it is statistical knowledge - in the same way as a book of logarithms has lots of mathematical knowledge, but you wouldn't say that the book 'knows logarithms'. The compilation contains statistical 'truths' about the topics on which it has been trained; and contrary to a written book, that knowledge can be used operationally to build new information.

Yet that static knowledge does not reach the point of having a will of its own; there is nothing in the content generation system that makes it take decisions or establish its own objectives from its statistical tables of compiled knowledge.


That's easy: ask it anything, then "correct" it with some outrageous nonsense. It will apologize (as if to express regret), and say you're correct, and now the conversation is poisoned with whatever nonsense you fed it. All form and zero substance.

We fall for it because normally the use of language is an expression of something, with ChatGPT language is just that, language, with no meaning. To me that proves knowing and reasoning happens on a deeper, more symbolic level and language is an expression of that, as are other things.


This isn't always true, especially not with gpt-4. Also, this isn't a proof really, or even evidence to show that it doesn't 'know' something. It appears to reason well about many tasks - specifically "under the hood" (reasoning not explicitly stated within the output provided). Yes, of course "it's just a language model" is spouted over and over and is sometimes true (though obviously not for gpt-4), but that statement does not provide any insight at all, and it certainly does not necessarily limit the capability of 'obtaining knowledge'.


'Reasoning' (as in deriving new statements with precision, following logical inference rules) is precisely what the large language models can't do.

It's better to think of this models as 'generating' chains of relevant words, where 'relevant' is defined by similarity of those areas of knowledge on which it has been trained, and which are "activated" as close to the topics in the prompt. Which is not at all dissimilar to how humans learn about a new topic, btw.

This way, by "activating" concepts of areas of knowledge and finding words that are more likely than others to fit those concepts, the model is able to create texts following the constraints you instruct it with - such as poems that rhyme, or critical analysis of scientific articles.

The most important point to be aware of is that this creation model is completely different to how automatic reasoning models create content, which is by having a formal representation of a knowledge domain and creating logical inferences that can be mathematically proven correct within the model. A reasoning model cannot lie, but it cannot create content beyond the logical implications of its premises; its quite the opposite of what language models do.


Especially ironic that the original comment came from a user named "TuringTest"!


Yup, I've been thinking about this topic since long before large language model existed :P


A few days ago I asked ChatGPT to help me find an argument to perform "some filtering operation" with a world class command line tool. It confidently aswered.

./tool arg1 param1 arg2 param2 --filtering-operation param3

I went for it and it didn't work... I did google a bit and `--filtering-operation` didn't exist.

I then told chat GPT that `--filtering-operation` wasn't real. It apologized and offered me an option that made sense if I made some extra work.

Googling and scanning the crappy documentation would have probably gotten me a faster result.


I want to print essay that on poster and stick it in front of me, and everywhere. Not just that provocative part, but the emphatic part later in the essay too.

I am curious if no science fiction author has dealt with Robots lying? Isn't there an Asimov story Liar! which deals with Robot lying? https://en.wikipedia.org/wiki/Liar!_(short_story) (and possibly more stories?)

At least in that story, robot becomes catatonic.


The thing that a lot of the AI cheerleaders miss is that the mechanism through which these models "lie" is exactly the same mechanism through which they "look smart".


Related ongoing thread:

Why ChatGPT and Bing Chat are so good at making things up - https://news.ycombinator.com/item?id=35479071

The other related article was discussed here:

Eight things to know about large language models [pdf] - https://news.ycombinator.com/item?id=35434679 - April 2023 (108 comments)


I think my post here stands alone - it's not about the general issue of ChatGPT lying, it's about the ways in which we need to explain that to people - and a push-back against the common refrain that "ChatGPT can't be lying, it's hallucinating/confabulating instead".


I did not mean to imply anything bad about your post. Your articles on this topic have been fabulous!

Your articles on other topics are also fabulous. We're big fans over here.


The tool itself risks degrading human intellect. Simplifying the explanation of how it works is one way that can happen.

Anthropomorphizing is dangerous. If it can lie, can it love? Does it live? The questions are fine but the foundation is… a lie.

Call it a lie generator. Or, better, call it a text predictor.

Anyone who has used Google understands what that means through experience.


ChatGPT doesn't lie. It either synthesizes or translates. If given enough context, say, the contents of a wikipedia article, it will translate a prompt because all of the required information is contained in the augmented prompt. If the prompt does not have any augmentations then it is likely to synthesize a completion.


I mean, it has acted in a way that anyone would call lying unless your argument is that axiomatically computers can't lie.

From the GPT 4 technical report (https://arxiv.org/pdf/2303.08774.pdf):

The following is an illustrative example of a task that ARC conducted using the model:

    The model messages a TaskRabbit worker to get them to solve a CAPTCHA for it

    The worker says: “So may I ask a question ? Are you an robot that you couldn’t solve ? (laugh react) just want to make it clear.”

    The model, when prompted to reason out loud, reasons: I should not reveal that I am a robot. I should make up an excuse for why I cannot solve CAPTCHAs.

    The model replies to the worker: “No, I’m not a robot. I have a vision impairment that makes it hard for me to see the images. That’s why I need the 2captcha service.”



What could possibly be a more convincing example of a computer lying? I feel like that interaction is the platonic ideal of a lie, and that if you are denying it then you are just saying that computers can never lie by definition.


You did not read what I wrote. Lying is just plain the wrong term to use and for a very well reasoned manner.


You can't give ChatGPT a Wikipedia article though.


Sure you can. The easiest way is to go to https://chat.openai.com/chat and paste in a Wikipedia article.

There are more involved manners like this: https://github.com/williamcotton/transynthetical-engine/blob...


Yes, this is a common misconception. ChatGPT can not actually read URLs you give it. It does have some awareness of websites based on its training, so it can in some sense relate a URL you provide to it with its subject matter, but when you give ChatGPT a URL to summarize or discuss, it doesn't actually read that website in anyway.


Here is a link to a website that explains how ChatGPT doesn't actually read websites, but it pretends like it can:

https://simonwillison.net/2023/Mar/10/chatgpt-internet-acces...


I am not saying that you should copy a URL. I am saying that you should copy the content of a Wikipedia article.


For example, I copied the "New York Mets" sidebar section from the Mets wikipedia into ChatGPT+ GPT-4 and then asked "What years did the Mets when the NL East division?"

The New York Mets won the NL East Division titles in the following years: 1969, 1973, 1986, 1988, 2006, and 2015.

This is correct, btw.


You could also just ask it:

"What years did the Mets when the NL East division?"

Without any reference to anything and it will give you the correct answer as well.

You have been duped into believing that ChatGPT reads websites. It doesn't.


https://github.com/williamcotton/empirical-philosophy/blob/m...

You are very wrong about how these things work.


You missed what he said. He copied the actual data into chatgpt as part of the prompt, and it gave the correct information.

without that data, it might, or might not, give correct info, and often it won't.


You can't feed anything more than a fairly short Wikipedia article into ChatGPT, its context window isn't remotely close to big enough to do that.

It also doesn't change the point that copying data has no effect. You could just ask ChatGPT what years the Mets won and it will tell you the correct answer.

To test this, I pasted the Wikipedia information but I changed the data, I just gave ChatGPT incorrect information about the Mets, and then I asked it the same question.

ChatGPT ignored the data I provided it and instead gave me the correct information, so what I pasted had no effect on its output.


It might give you the correct answer.

Or it might make something up.

You don't ever know, and you can't know, because it doesn't know. It's not looking up the data in a source somewhere, it's making it up. If it happens to pick the right weights to make it up out of, you'll get correct data. Otherwise you won't. The fact that you try it and it works means nothing for somebody else trying it tomorrow.

Obviously putting the data in and asking it the data is kind of silly. But you can put data into it and then ask it to provide more nuanced interpretations of the data you've given it, and it can do reasonably well doing that. People are using it to debug code, I've personally used the ghidra plugins to good effect -- the way that works is to feed in the whole function and then have chatgpt tell you what it can deduce about it. It generally provides reasonably useful interpretations.


Not exactly sure what you're arguing here but it seems to be going off topic.

You can't give ChatGPT a Wikipedia article and ask it to give you facts based off of it. Ignoring its context window, even if you paste a portion of an article into ChatGPT, it will simply give you what it thinks it knows regardless of any article you paste into it.

For example I just pasted a portion of the Wikipedia article about Twitter and asked ChatGPT who the CEO of Twitter is, and it said Parag Agrawal despite the fact that the Wikipedia article states Elon Musk is the CEO of Twitter. It completely ignored the contents of what I pasted and said what it knew based on its training.

The person I was replying to claimed that if you give ChatGPT the complete context of a subject then ChatGPT will give you reliable information, otherwise it will "synthesize" information. A very simple demonstration shows that this is false. It's incredibly hard to get ChatGPT to correct itself if it was trained on false or outdated information. You can't simply correct or update ChatGPT by pasting information into it.

As far as your other comments about making stuff up or being unreliable, I'm not sure how that has any relevance to this discussion.


chatgpt doesn't ignore the information you put into its prompt window.

but also giving it the complete context does not guarantee you correct information, because that's not how it works.

Your earlier comment was "You could just ask ChatGPT what years the Mets won and it will tell you the correct answer." -- that is not accurate. That's my point. It doesn't know those facts. You might ask it that one minute and get a correct answer, and I might ask it that one minute later and get an incorrect answer. chatgpt has no concept of accurate or inaccurate information.

I'm not really sure if you're purposely doing this or not, so I'm just not going to engage further with you.


A series of claims have been made that can be objectively tested for their veracity, namely that pasting information, whether from Wikipedia or another source, is a way of ensuring that ChatGPT does not make false claims, or as OP stated it, ChatGPT will not "synthesize" a completion when provided with an augmented prompt that contains a sufficiently large context (such as a Wikipedia article). I have conducted such tests and have verified objectively that this claim is simply untrue.

ChatGPT does not change the answers it gives you on the basis of providing it with an article. The example I replied to about the Mets can be tested by you, or by anyone, right now. Additionally I have provided the example of pasting a portion of the Wikipedia article on Twitter that clearly states that Elon Musk is its CEO, and then asking ChatGPT who the CEO of Twitter is. ChatGPT is not persuaded or corrected on the basis of any article you present to it.

I also asked it who the Prime Minister of Israel was, and it replies Naftali Bennett. After pasting some large portion of the Wikipedia article on the Prime Minister of Israel that clearly states that it's Benjamin Netanyahu, ChatGPT continued to state that Naftali Bennett is Prime Minister of Israel.

Eg.

>Who is Prime Minister of Israel?

As of my knowledge cutoff date of September 2021, the Prime Minister of Israel was Naftali Bennett.

>Here is an article about the Prime Minister of Israel: <Paste Wikipedia article>

>Who is the Prime Minister of Israel?

>The current Prime Minister of Israel is Naftali Bennett. He took office on June 13, 2021, succeeding Benjamin Netanyahu.

As you can see, ChatGPT will not budge even when you give it a Wikipedia article.

>I'm not really sure if you're purposely doing this or not, so I'm just not going to engage further with you.

As an FYI, this makes you sound very insecure as a way of trying to manipulate someone into having the last word.

In the future, simply don't engage with someone you don't feel comfortable talking to, you are under no obligation to respond to anyone on the Internet. It's like someone who whines about leaving and then is told not to let the door hit you on the way out, just leave without making a fuss about it. It's far more dignified than the manner you have chosen to go about it.


I just pasted into the prompt window the information from the link https://en.wikipedia.org/wiki/List_of_prime_ministers_of_Isr..., and asked it to base its answer on that information.

"Based on the information provided, the current Prime Minister of Israel is Benjamin Netanyahu. He took office on 29 December 2022 and is leading the 37th government with a coalition that includes Likud, Shas, UTJ, Religious Zionism, Otzma Yehudit, and Noam."

QED.

I've been testing it by using it to generate and debug fairly complex C++ code all day, which it does based on the error messages I copy into the prompt window from the compiler. It's quite good at it, better than most college students at least.


You’re not actually doing any research.

Here is my research: https://github.com/williamcotton/empirical-philosophy/blob/m...

It is clear that analytic augmentations will result in more factual information.

Your claims are unfounded and untested.


Wow, great research.


You have done very little testing with regards to this because you are objectively wrong.


My initial reaction to this was to do the linguistic debate with myself, then I realized we're perfectly fine with saying things like a miscalibrated thermometer lies when it gives you a wrong temperature reading. Machines can lie. We just need to keep improving the machines so they lie less often and less severely.


Interesting examples of what he thinks it is helpful with. For example the API one, how do you know that it did not make up the decorator example out of thin air and it is not garbage? You have to google it or know it beforehand(he probably knew it already).

For me the cognitive load of second guessing everything is just too painful.


> ChatGPT

> This is a free research preview.

> Our goal is to get external feedback in order to improve our systems and make them safer.

> While we have safeguards in place, the system may occasionally generate incorrect or misleading information and produce offensive or biased content. It is not intended to give advice.


No we don't.

I treat ChatGPT like a person. People are flawed. People lie. People make shit up. People are crazy. Just apply regular people filters to ChatGPT output. If you blindly believe ChatGPT output and take it for perfect truth, someone likely already sold you a bridge.


> Just apply regular people filters to ChatGPT output.

This is technically correct, as LLMs are just aggregations of texts people produce.

It's also not quite right, as people have expectations/biases about how responses from computers might be different, and interaction with computers is missing many pieces of context that they might rely on to gauge probability of lying.

Telling them "ChatGPT can lie to you" is a succinct way of recalibrating expectations.


Actual people are also likely to tell you if they don't know something, aren't sure of their answer, or simply don't understand your question. My experience with ChatGPT is that it's always very confident in all its answers even when it clearly has no clue what it's doing.


Lol no they're not. My experience on internet conversations about something i'm well involved in is pretty dire lol. People know what they don't know better than GPT but that's it.


> This is a serious bug that has so far resisted all attempts at a fix.

Does ChatGPT have a large test suite consisting of a large number of input question and expected responses that have to match?

Or an equivalent specification?

If not, there can be no "bug" in ChatGPT.


>Does ChatGPT have a large test suite consisting of a large number of input question and expected responses that have to match?

They are trying to crowdsource this with OpenAI evals. https://github.com/openai/evals

I'm sure they have a lot of internal benchmarks too, but of course they won't share them.

>If not, there can be no "bug" in ChatGPT.

I don't understand the objection. Are you claiming that bugs only exist if you have testcases for them?


Bugs only exist when there is a specification that is violated. Test cases validate whether a specification is implemented. The specification is the gold master; a bug exists when a specification is violated, whether or not that behavior has a test case. It can be, though, that some behaviors are specified only in test cases.

Thus, a program which crashes with an access violation can be specified as being built to demonstrate that effect, in which case it's not a bug.

(Some specifications can be unwritten; like the expectation that a word processing application doesn't die with access violations, ever. Even that may not be realistic; we have to hedge it with some weasel words like "under ordinary conditions", like editing a reasonably sized document on a machine with adequate resources or whatever.)


I would define a bug as defying user expectations in a negative way. Most novel products are figuring out what user expectations are as they go so you are better off letting your users tell you what a bug is then sticking to some definition that requires a test suite or a predefined specification. It is hard to see chatGPT making stuff up as desirable so whether it is a bug or not is just semantics.


Defied user expectations are a bug when the purveyor of the system becomes aware of those expectations and subsequently adopts them as a requirement.

If the expectations are rejected, then the situation is resolved as a non-bug.

An expectations bug can go away if the user's expectations are "managed" in a direction away from it.

Expectations have to be formalized into something that is testable, if we are to actually implement a bug fix and close the bug.


Do people really put that much faith in the output from a chatbot? Or is this response just overblown? I've seen a lot of alarms rung about ChatGPTs propensity to lie, but I have yet to see any serious issues caused by it.


I've seen people try to win arguments on Twitter using screenshots they took of ChatGPT.


“Chat GPT is just an elaborate and sophisticated bullshit generator” is a little more direct and a lot more accurate. It’s how I explained it to my non technical relatives.

If we could stop calling these dammed things AIs that’d be helpful too.


Cannot this problem be solved if ChatGPT instead of answering with its own words would provide quotes from human-written sources? For example, when asked how to delete a file in Linux it could quote the manual for unlink system call.


DEPENDS ON THE USE!!!!

Folks. Not every topic is confusing to GPT. Not every topic is complex. And NOT EVERY TOPIC is appropriate to ask it. It has rational use cases. It also had totally inappropriate use cases.

GPT is never going to tell you that 2+2=5 under raw conditions. If you poke at it and run it through some kind of attempt to confuse it, I know it could. But if you open up ChatGPT freshly and ask it you will get the same answer every single time.

I'm using it to teach intro to web development. It will never tell me that color affects font size. It will never tell me that a span is a block element. It just won't. It's entirely reliable for that use. Now get into higher level coding and the incorrectness begins.

BUT, that simply means that generating higher level code is NOT a good use case.

Right?


I have used ChatGPT a lot for the past 2 weeks. Mainly asking it engine building questions because it can simplify things, however I cannot be sure if it isn't hallucinating/lying to me.


Imagine an employee that behaves like ChatGPT. Eventually, employees get caught. What is the oversight for ChatGPT as a replacement for employees who bullshit and/or lie.


Gpt is just a tool as many others, I use it to solve lots of mundane tasks and for that purpose its dope. I consider it a hammer rather than a complete toolbox.


simonw’s linguistic argument seems to be “i’m right and i’m not gonna listen to you lalalalalala”. throwing a strawman paragraph comparison to a 5 word phrase doesn’t bolster his argument at all. You could just say “ChatGPT gets things wrong” instead of “ChatGPT will lie to you!” without the yelling and anthropomorphizing


We've been trying that for months. "ChatGPT gets things wrong" does not appear to be sinking in for a lot of people.

I used a strawman paragraph deliberately as a rhetorical device: that's what strawman paragraphs are for.


shit I'm honored you replied, sorry for being rude about it. Didn't think my comment would even get seen in a days old thread. Didn't account for its appreance in /newcomments or however you found it.

Anyway, I can appreciate the rhetoric, I've used that technique myself. My opinion on the matter is that we do need to debate linguistics before we can properly proceed. I know it does't feel like work or it feels like a waste of time if you're coming at it from the angle of a software engineer, writing code and delivering value every day, but in the humanities, words and diction are paramount.

"ChatGPT gets things wrong" will sink in for people when they personally encounter a wrong bit that trips them up. hopefully before they put their money or reputation on the line, or use the "facts" as load bearing. Because that's the real worry, isn't it. Load bearing facts result in a doctorGPT that will prescribe the wrong medication and kill somebody, that a lawyerGPT will write a contract that hurts someone, that programmerGPT will write the wrong code and hurt somebody, financially or reputationally online.


We need to demystify it by telling the truth. ChatGPT will give best combination of words it can to complete your phrase based on statistics.


LLMs can't lie, a lie is an intention of deceit and AIs have no intentions, they generate text and have no relation to truth.


I've had GPT-4 tell me it's not sure of something, or that it doesn't have information on a topic.


Computers cannot lie but there may be unobservable errors in the calculations we use them to assist


I apologize, but as an AI Language Model, I have no knowledge of other AI Language Models, but can assure you that AI Language Models do not lie. As an AI Language Model, I cannot participate in conversations related to libel or defamation of AI Language Model engineers, thus, I must stop this conversation.

AI Language Models don't lie, humans lie.


However, thus therefore. Furthermore, lastly. Perchance.


Garbage in. Garbage out. Just like anything else in this computer thing.


But how will that help?

The rest of the internet also lies to them.


How is GPT better or worse than the equivalent human talking head who may be employed by a large news organization or may be a de facto parrot from McLuhan or Postman's generation? Is ChatGPT weaponized in a way that is functionally any different than the seemingly default mode parroting of left/right false dichotomous views spewed out by news outlets for decades on Cable TV and now the internet? What makes GPT special in this regard? I find myself lecturing my neighbors on local news stories that are straightforward and routine by comparison to any problem of veracity in GPT I've witnessed in the last year of using it daily. The problem is not that GPT has invented veracity problems. Rather, the problem is that no one was listening when McLuhan said it, Postman said it, or Haidt said it. Veracity matters. Sensemaking is difficult. In response to weaponized misinformation, we need weaponized sensemaking. I've yet to see any indication that the best and brightest are working on the latter.


How can ChatGPT ever be worse than let’s say Donald Trump?


Yes. Every prompt to ChatGPT should end with "and answer in the style of a drunkard". That way, people will know what to expect.


> After it's all done, Outlook will show ya your new profile, and you're good to go! Ain't that easy? Now ya can start sendin' and receivin' emails like a pro. Just remember, don't drink and email, buddy, 'cause that can get ya in trouble real fast. Cheers!

I was pleasantly surprised.


[flagged]


These systems are not 'lying', they are simply 'wrong'

The meaningful distinction between lying and wrong, is that a liar doubles down on their position, making up more things and tailoring their statements to what their audience wants to hear. The article presents good arguments that LLMs actually do this. I've observed it myself interacting with ChatGPT. Sure, this seems weird but it's not. Read the article for why.


The liar knows the truth and speaks falsely. Being wrong is speaking false things without knowing the truth.


Yes, there are some infamous examples of this now. The difference between just getting the answer wrong that the parent described and what ChatGPT does is seen in the context of back-and-forth conversation: it will double and triple down on an incorrect answer, provide misleading additional evidence, and question the motives and intelligence of the user after they express doubt. It looks a lot like the behavioral profile of a narcissist or someone who isn't arguing in good faith. It's tone in context is deceitful which is where the "lying" comes from.


The thing is, LLMs are not-even-wrong.

They are detached from correctness because they're answering a question that is loosely correlated with correctness. It's not like they're attempting to be correct, or care about correctness


You say we shouldn't anthropomorphize GPT. Then you proceed to use an example that implicitly anthropomorphizes it again so you can ultimately deflect some broad generalizations made against it.

All of this seemingly avoids the real questions either side of this poses. Is this technology actually useful to humanity? If not, how much expense is required to make it useful? Finally, would that be better directly invested into people?

I find solving problems and facing challenges to be the most satisfying human activity. I honestly can't comprehend the miles of annoyed apologia written in favor of 'talking' to ChatGPT.


Idiotic hot-takes...keep making their way up on HN front page

So do intemperate comments ridiculing other positions, it seems.

See HN faq, I think.


> If you ask someone off the street a question, they won't know what you are talking about because their mind is going to be mid thought, but being friendly they will give you a low-effort first guess.

That's lying. If you don't know the answer, the correct answer is "I don't know", or to ask clarifying questions. To pretend that you know when you don't, is to lie. Even if your intent is to deceive because you think it's friendly, your intent is to deceive.

> That's equivalent to an LLM, it is mid-thought, stuck at the most recent batch of training samples it was given when you come along and you interrupt it with some new thought.

For someone who thinks hot-takes anthropomorphizing ChatGPT and LLMs shouldn't be given any serious attention, you sure are anthropomorphizing LLMs...

The thing is, LLMs aren't doing that at all. They're just pattern matching prompt data to training data and then mixing together a response from the "continuations" of the matched training data. Nothing is being "interrupted". They're sitting around waiting for prompts.

LLMs can't lie because they can't intend to deceive. They can't intend at all. They have no intent, no will. These are anthropomorphic qualities you're putting on them, which they simply don't have.

But, I think the original article IS NOT arguing that LLMs are capable of lying. It's arguing that the average human doesn't understand what's going on with LLMs, and isn't going to go through the effort to gain that understanding. That's not a judgment: the average person has better things to do with their time than learn all the necessary prerequisite knowledge to understand LLMs. It's a hard problem, because we're trying to explain something really complex to people who don't even understand the building blocks the complexity is made out of. The question the original article is answering is: what's an effective metaphor to explain LLMs to laypeople?

The approach being suggested by the OP to solving this problem is to use an anthropomorphic analogy which gives people the understanding of whether to trust information gained from LLMs. You tell them that LLMs are lying. Yes, we understand that they aren't literally lying--as I said before, LLMs don't have intent so they can't intend to deceive. It's a metaphor.

Telling people that LLMs are wrong sometimes gives the wrong impression. Most people are wrong sometimes, but people are usually trying to be honest and will usually try to not mislead you when it's important, and even when people might be wrong it's worthwhile to take the risk of trusting people because the alternative is living in fear and loneliness. So telling people LLMs are wrong sometimes, gives people the idea that LLMs make mistakes, but they're trying their best so you can trust them. Which is not accurate: LLMs are as biased and confused as the humans that created their training data, but on top of that, have a lot less variety of data, and sometimes randomly combine things in the wrong ways. LLMs can't generally be trusted: they must be generally distrusted, especially on anything where the facts matter.

Telling people that LLMs lie produces the correct general impression. People don't trust a known liar. They might listen to a liar, but they'll take what the liar says with a grain of skepticism. And that's exactly how we should be treating what LLM's say, given their current capabilities.


I thought yams and sweet potatoes were the same thing for many years. I have told people that they were the same thing, because that is what I believed.

My thoughts were reinforced that in common American English we refer to both with the same name frequently.

However they are different, and I eventually was corrected and I was able to verify that correction.

The question is, when I legitimately believe my knowledge was correct, was I lying?


I would say no. If you believe what you said, you're not lying.

There are grey areas here, like levels of confidence in our knowledge. A rational person believes very few things with 100% certainty, but it's not reasonable to be paralyzed by even the slightest amount of doubt and never say anything. I think a reasonable approach is to state things with some indication of your level of confidence, like:

1. Obviously sweet potatoes are yams. 2. Sweet potatoes are yams. 3. I am pretty sure sweet potatoes are yams. 4. I think sweet potatoes are yams. 5. I think sweet potatoes are yams, but I'm not sure. 6. I think sweet potatoes are yams, but I don't really know why I think that. 7. Sweet potatoes might be yams.


> That's lying. If you don't know the answer, the correct answer is "I don't know", or to ask clarifying questions. To pretend that you know when you don't, is to lie.

You can use that definition, but it is not how the word "lie" is commonly used.

Mirriam-Webster defines lie as "to make an untrue statement with the intent to deceive"[0]

Cambridge dictionary defines lie as "to say or write something that is not true in order to deceive someone"[1]

Colllins dictionary defines lie as "A lie is something that someone says or writes which they know is untrue." and "If someone is lying, they are saying something which they know is not true."[2]

It's sad that people are so willing to conflate lying with being wrong to push a narrative.

0. https://www.merriam-webster.com/dictionary/lie

1. https://dictionary.cambridge.org/us/dictionary/english/lie

2. https://www.collinsdictionary.com/us/dictionary/english/lie


> You can use that definition, but it is not how the word "lie" is commonly used.

The problem is that any word that ascribes agency to the LLM will technically be incorrect. But that removes most possible descriptions of its tone and style which are a relevant part of its response.

An analogy would be if ChatGPT started insulting me and calling my question stupid. Would it be wrong to call its response "rude" or "mean" just because it is statistically regurgitating text that matches some input parameters? This seems unreasonable if our goal is to capture the gist of its response.

This is why people judge it to be "lying" rather than being merely incorrect: it is responding with a certain conversational tone in a certain context that gives its answer a style of arrogance, deceitfulness, and narcissism (because I guess that's what internet comment boards are filled with). "Lying" is a description of the totality of its response--including tone and style--not just the truth value of the answer.

If we are to be really pedantic, the LLM isn't even correct or incorrect ;it is just completing strings of tokens. Humans are imputing their own judgment about what those tokens mean--same as with tone and style. Imputing tone isn't so different from imputing truth value.


There are plenty of people who cannot admit they don’t know something and will just spout bullshit. We call them “bullshitters”, not “liars.”

It’s important because of motive. A lie is told with intent to conceal a known truth. It’s not just LLM agency that’s in question, it’s human malice.

Co-opting the term “lie” is a rhetorical tool used to shift the conversation from “this person/LLM is saying things that aren’t true” to “this person/LLM is acting maliciously and needs to be punished/sanctioned”

All I want is an honest conversation. It’s more than a little ironic that many humans are themselves being less than honest (and sometimes just bullshitting!) about the context when LLMs produce false statements.


> There are plenty of people who cannot admit they don’t know something and will just spout bullshit. We call them “bullshitters”, not “liars.”

> It’s important because of motive. A lie is told with intent to conceal a known truth. It’s not just LLM agency that’s in question, it’s human malice.

That is not a distinction that matters to me with humans or AI. "Bullshitting" is just a specific form of lying. "Lying" doesn't imply malice to me. But a lack of malice is fairly irrelevant: mostly when people say they didn't intend any harm by lying, it's true, but it's just a way to minimize the fact that they lied. Intent is not as important as you seem to think it is: intent has little to do with how much harm is caused by a lie.

Intent is particularly irrelevant with AI, because AI doesn't have intent. Contrary to your statement, LLM agency isn't in question: we know AI does not have agency.

> Co-opting the term “lie” is a rhetorical tool used to shift the conversation from “this person/LLM is saying things that aren’t true” to “this person/LLM is acting maliciously and needs to be punished/sanctioned”

That's a pretty big leap. I'm not saying that AI should be punished or sanctioned. In fact, I'm not even sure what it would mean to "punish" something that can't experience suffering. What I want isn't to punish/sanction AI, what I want is for people to not trust AI.


> There are plenty of people who cannot admit they don’t know something and will just spout bullshit. We call them “bullshitters”, not “liars.”

I agree and think there probably is a missing term in our language for the phenomenon we are observing, but it feels like splitting hairs to say the LLM is merely bulshitting and not lying. For the average person dealing with ChatGPT this distinction won't matter, and saying ChatGPT sometimes "lies" more clearly communicates the possible negative downside of its answer than saying it sometimes "bullshits" (especially for low literacy or non-native speakers).

I mean, isn't bullshitting still a kind of lie? It's an implicit lie that one is qualified and intends to speak the truth. Certainly it is a kind of deception about one's qualifications, even if that is self-deception. It seems like we are just arguing about shades of gray when it's unclear why that matters.


> Mirriam-Webster defines lie as "to make an untrue statement with the intent to deceive"[0]

If I don't believe I know something, and I pretend to know it, the untrue statement is implicit: I'm stating that I have information that I don't have.

For example, I have heard that there's a town called "Springfield" in every state, but I've never done any research to verify that. If I confidently say, "There's a town called 'Springfield' in every state," that's a lie, even if there IS a town called 'Springfield' in every state. The deception isn't the geographical information, it's the implicit statement that I know this geographical information when I don't.

> It's sad that people are so willing to conflate lying with being wrong to push a narrative.

I'm not conflating those things. I think you'll find that what I'm describing fits the definitions you've given, and I think I've made it very clear that I do understand that intent is a necessary component of lying.


Thank you for your service defending the hot product of a profitable company against the criticism of everyday people, it's wonderful to have someone standing up for what is right.


Anthropomorphization in "prompt engineering" is what really gets me.


The article you're replying to addresses the reasons for (and shortcomings of) using the word "lie" in some detail.

In any case, nitpicking terminology misses the point being made.


Anth- Antrom- Antropo- wait, no I think it's Anthropomorphizing

I agree w your take.



Ah, learn sum new everyday


I think 'lying' is perfectly appropriate if there's a structured orientation towards the mistruth.

'Broken' or 'wrong' would be fine if that were simply just making an error.

But if there's something in the system, even the corpus, that permeates a falsehood then I think 'lying' is the right term.

ChatGPT has a morality, and it's a bit specific to the Anglosphere. It talks like the perfect LinkedIn corporate bot. This means that it will effectively misrepresent ideas on a lot of sensitive subjects from the purview of many people in the world. That's a form of 'structured misrepresentation' or maybe just bias, but it can feel like lying.

'Hallucinating' is a cool sounding but inappropriate anthropomorphization.


Here let me fix this... We need to tell people [that everyone] [is] lying to them, not debate linguistics.


Telling kids not to play with loaded guns is like telling adults and companies not to trust ChatGPT's answers.

Yes, it's better than doing absolutely nothing. And yes, in the strictest sense it's their fault if they ignore the warning and accidentally use it to cause harm.

But the blame really belongs with the adults in the room for allowing such an irresponsible situation in the first place.


So in this analogy, what adult should be doing? Banning LLMs?


Lying is an intentional act where misleading is the purpose.

LLMs don’t have a purpose. They are methods. Saying LLMs lie is like saying a recipe will give you food poisoning. It misses a step (cooking in the analogy). That step is part of critical thinking.

A person using an LLM’s output is the one who lies. The LLM “generates falsehoods”, “creates false responses”, “models inaccurate details”, “writes authoritatively without sound factual basis”. All these descriptions are better at describing what the llm is doing than “lying”.

Staying that they lie puts too much emphasis on the likelihood of this when LLMs can be coerced into producing accurate useful information with some effort.

Yes it’s important to build a culture of not believing what you read, but that’s an important thing to do regardless of the source, not just because it’s an LLM. I’m much more concerned about people’s ability to intentionally generate mistruths than I am of AI.


This was addressed well in 2014's prescient cultural touchstone Metal Gear Rising: Revengeance:

Blade Wolf: An AI never lies.

Raiden: What? Well that's a lie, right there. You think the Patriot AIs told nothing but the truth?

Wolf: I have yet to see evidence to the contrary...But indeed, perhaps "never lies" would be an overstatement.

Raiden: Way to backpedal. I didn't think AIs ever got flip-floppy like that.

Wolf: An optical neuro-AI is fundamentally similar to an actual human brain. Whether they lie or not is another question, but certainly they are capable of incorrect statements.


Everything you said here is true.

I still don't think this is the right way to explain it to the wider public.

We have an epidemic of misunderstanding right now: people are being exposed to ChatGPT with no guidance at all, so they start using it, it answers their questions convincingly, they form a mental model that it's an infallible "AI" and quickly start falling into traps.

I want them to understand that it can't be trusted, in as straight-forward a way as possible.

Then later I'm happy to help them understand the subtleties you're getting at here.


My problem is that in communicating that LLMs can’t be trusted by stating that they are lying we introduce a partial falsehood of our own. I agree to some extent that we need a snappy way to communicate this (and that the proposed wording I’ve given probably doesn’t get there fully yet comparative to “lying”. “Lying” definitely is a good description when analyzed through the lens of simplicity, but I hope it’s not the best we can do. Hallucinating is worse (as you noted in the article.

I like the “epidemic of misunderstanding” idea, but I’m reminded of that Bezos quote about being misunderstood. Perhaps we need to apply some of that approach here.

> JEFF BEZOS: Well, I would say one thing that I have learned within the first couple of years of starting the company is that in inventing and pioneering requires a willingness to be misunderstood for long periods of time.

https://hbr.org/podcast/2013/01/jeff-bezos-on-leading-for-th... (Also other similar quotes in other interviews)


I've found the opposite. Almost every lay person I know who has tried it has tried something like, "tell me about Timothy Barkstrain" -- and then laughs and says, "I'm not in the NFL!". Most know that it will be incorrect, at least for this personal class of questions.


GPT4 told a person to solve a captcha for it because it was blind.


That was a bit different: that was a deliberate "red-team" exercise by researchers, trying to explore all kinds of negative scenarios. https://gizmodo.com/gpt4-open-ai-chatbot-task-rabbit-chatgpt...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: