Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

But AlphaGo etc don’t use any kind of language-based AI, so LLMs (which this thread was about) are no good.


The next step seems to be applying past advances in reinforcement learning with modern transformer based models


Which multiple teams are working on - OpenAI (Q*), and Meta just released a reinforcement learning framework


Could you point me towards Meta's reinforcement learning framework? I'd like to see how it stacks up against the OpenAI gym.



Thank you!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: