Submissions from raw.sh

		Batched reward model inference and Best-of-N sampling (raw.sh)
		34 points by rawsh 8 months ago \| past
		Teaching LLMs to solve chess puzzles with DSPy and Finetuning (raw.sh)
		1 point by rawsh 10 months ago \| past
		Teaching chat models to solve chess puzzles (raw.sh)
		4 points by rawsh 11 months ago \| past