Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Ask HN: Did Google know about RLHF(breakthru) only after OpenAI shared
1 point by elarocks 3 months ago | hide | past | favorite | 1 comment
I am still unable to think of a reason why Google, Anthropic and AWS did not know about/invest in RLHF, before OpenAI shared their success around implementing and in scale that was viable. Would you say that if OpenAI had not shared about RLHF, Google and Anthropic wouldn't be where they are today ?


RLHF is basically a fancy, overengineered GAN. Most of the industry could see that DPO was more efficient for fitting to human behavior.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: