Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
nitwit005
20 hours ago
|
parent
|
context
|
favorite
| on:
Some critical issues with the SWE-bench dataset
If you know some way to get people to volunteer millions of dollars of free labor, there are better uses of their time than evaluating LLMs.
Join us for
AI Startup School
this June 16-17 in San Francisco!
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: