Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Show HN: I built "AI Wattpad" to eval LLMs on fiction (narrator.sh)
32 points by jauws 8 days ago | past | 32 comments
Show HN: Evaluating LLMs on creative writing via reader usage, not benchmarks (narrator.sh)
36 points by jauws 6 months ago | past | 12 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: