Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
up6w6
on July 30, 2024
|
parent
|
context
|
favorite
| on:
Kagi LLM Benchmarking Project
Very audacious to call it "almost perfect" when it has only what appears to be 50 questions. For comparison, MMLU contains 57 tasks and more than 100k questions.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: