Hacker News new | past | comments | ask | show | jobs | submit | from login
Refusals (LLM Leaderboard) (mandoline.ai)
2 points by kmckiern 8 days ago | past | discuss
Comparing Refusal Behavior Across Top Language Models (mandoline.ai)
2 points by kmckiern 15 days ago | past
Show HN: Mandoline – Custom LLM Evaluations for Real-World Use Cases (mandoline.ai)
2 points by kmckiern 57 days ago | past

Consider applying for YC's W25 batch! Applications are open till Nov 12.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: