Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It seems like you might need less output tokens for the same quality of response though. One of their plots shows o3 needing ~14k tokens to get 69% on SWE-bench Verified, but GPT-5 needing only ~4k.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: