How did OpenAI train ChatGPT to be so (cloyingly) wholesome?

skilled · on Dec 23, 2022

Reinforcement learning combined with real humans adjusting the model's output. It's not just a bunch of text they gathered being mashed together by AI. There's an extremely strong human touch to this model.

logicallee · on Dec 23, 2022

So it was rewarded/reinforced every time it brought things back to a positive message at the end?

skilled · on Dec 23, 2022

Not necessarily trained only to be positive but yes your assumption is right. I don’t know the exact details but I think there have been some OpenAI employees talking about this on their personal blogs.

If I can find the links over the weekend I will make another reply.

logicallee · on Dec 23, 2022

Thanks, I haven't seen any of their blogs and would love a link.