Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Hard to generalize from before/after performance.

While this is true, there are ways to test (open models) on tasks created after the model was released. We see good numbers there as well, so something is generalising there.





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: