Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Language Models Learn to Mislead Humans via RLHF (arxiv.org)
3 points by Anon84 on Nov 2, 2024 | hide | past | favorite | 1 comment


Anticipated by Stanislaw Lem in his History of Intellectronics.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: