Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Either way GPT-2 to GPT-3 was a much bigger step and happened much quicker. What we see now are already after much fine tuning, testing with humans, data filtering etc, more of that will result in smaller and smaller improvements. There is so much money spent on these models that they have already tried so many things, so getting to the edge of the S curve happens much quicker.



Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: