Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Batched reward model inference and Best-of-N sampling (raw.sh)
34 points by rawsh 8 months ago | past
Teaching LLMs to solve chess puzzles with DSPy and Finetuning (raw.sh)
1 point by rawsh 10 months ago | past
Teaching chat models to solve chess puzzles (raw.sh)
4 points by rawsh 11 months ago | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: