Right, but RLHF is mostly reinforcing answers that people prefer. Even if you do... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		ascorbic 7 days ago \| parent \| context \| favorite \| on: Claude Opus 4 and 4.1 can now end a rare subset of... Right, but RLHF is mostly reinforcing answers that people prefer. Even if you don't believe sentience is possible, it shouldn't be a stretch to believe that sentience might produce answers that people prefer. In that case it wouldn't need to be an explicit goal.

root_axis 7 days ago [–]

>it shouldn't be a stretch to believe that sentience might produce answers that people prefer

Even if that were true, there's no reason to believe that training LLMs to produce answers people prefer leads it towards sentience.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact