The Bing LM, or rather the service, did have "inner monologue" in the sense of t...

Vespasian · on April 11, 2023

Interesting. I didn't know that.

When using gpt-4 directly through the API we can emulate this behavior

beepbooptheory · on April 11, 2023

And you trust what it told you?

int_19h · on April 11, 2023

No, but the reconstructed examples have "im_start" and "im_end", which strongly implies that it is, if not verbatim, then a close enough restatement of the real deal. Take a look:

https://www.make-safe-ai.com/is-bing-chat-safe/Prompts_Conve...

vintermann · on April 11, 2023

Yup, for the same reason I trust e.g. jailbreaks exposing the prompt: it was consistent.

Really, just asking again is a fine way to expose all sorts of "hallucinations" in a LM.