Hacker News new | past | comments | ask | show | jobs | submit login
AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models (github.com/ruixiangcui)
2 points by accrual 6 months ago | hide | past | favorite



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: