This seems like it would signal good things for AI alignment, don't you think? I...

kromem · on Jan 13, 2024

If you look at the examples live, you'll see before it 'successfully' answered the way users wanted, the literal "Adolf Hitler" AI called out a user's antisemitism. It was only after they pushed more with a follow-up prompt saying it was breaking character that it agreed.

And it's a much more rudimentary model than Grok or certainly GPT-4.

You're simply not going to get a competitively smart AI that's also spouting racist talking points. Racism is stupid. It correlates with stupid. And that's never not going to be the case.

JieJie · on Jan 14, 2024

I absolutely positively agree. In my experience, racism is a symptom of ignorance, which is cured by intelligence.