Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Agreed, the biggest takeaway from how much Anthropic puts into alignment, and still ends up with a model that can end up doing things that are clearly out of alignment, should be that alignment is very tricky.
 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: