Hacker News new | past | comments | ask | show | jobs | submit login
Show HN: Kevin-32B – how to do multi-turn RL on writing CUDA kernels (cognition.ai)
7 points by silasalberti 49 days ago | hide | past | favorite
Hey – we just published a blog post about Kevin-32B = K(ernel D)evin.

It's to our knowledge the first open-source model that's RL-trained on CUDA kernels. Our goal was to demonstrate multi-turn RL using GRPO. We used 180 Python->CUDA conversion tasks from the KernelBench dataset.

The results were surprisingly strong! We were able to outperform top reasoning model like o3 & o4-mini.

We're sharing our training setup and learnings in the blogpost. Also the model is on HuggingFace: https://huggingface.co/cognition-ai/Kevin-32B




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: