Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Very audacious to call it "almost perfect" when it has only what appears to be 50 questions. For comparison, MMLU contains 57 tasks and more than 100k questions.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: