It just seems odd to me that it's not given an incentive to communicate this. Su...

aidenn0 · on Oct 6, 2023

Go read through any mass of training data and count how often "I don't know" appears. It's going to be very small. Internet fora are probably the worst because people who are aware that they don't know usually refrain from posting.

og_kalu · on Oct 6, 2023

>These services are created to give the best product to users, and so wouldn't this be a better product? Therefore there is incentive. Happier users and a product that is better than competitors.

Why would the computation care about any of that ? I'm talking about incentive for the model.

ooterness · on Oct 7, 2023

Incentive for the model is to survive RLHF feedback from contract workers who are paid to review LLM output all day. They're paid for quantity, not quality. Therefore, optimum strategy is to hallucinate some convincing lies.

SirMaster · on Oct 8, 2023

Why are they paid for quantity not quality though?

Sounds like it is a choice of the model creators then if they could instruct their testers to reward quality.

ooterness · on Oct 8, 2023

How would that work? Quantity is easy to measure. Quality is not.

SirMaster · on Oct 7, 2023

Doesn’t the model want to make the user happy?

Its responses sure seem like it does.

I’d be happier with its responses if it was honest about when it was not confident in its answer.

og_kalu · on Oct 7, 2023

Go look at the first link I sent. Rewarding for "making users happy" destroys GPT-4's calibration.

Why would "making users happy" incentivize for truth ?

SirMaster · on Oct 8, 2023

Because getting truthful answers would make users happier?

Seems like common sense to me.

Who’s asking the chat bot questions not looking for or wanting a truthful answer a lot of the time?

If the model understood or captured “human interest” at all in its training this should be pretty fundamental to its behavior.

olddustytrail · on Oct 7, 2023

Yes, the computer wants you to be happy. Happiness is mandatory. Failure to be happy is treason.

hutzlibu · on Oct 6, 2023

"I'm talking about incentive for the model. "

In Douglas Adams Hitchhikers Guide to the Galaxy, this is (somewhat) fixed by giving the AIs emotion ..