Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Even with large contexts there's diminishing returns. Just having the ability to stuff more tokens in context doesn't mean the model can effectively use it. As far as I can tell, they always reach a point in which more information makes things worse.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: