Hacker Newsnew | past | comments | ask | show | jobs | submit | bburtenshaw's commentslogin

thanks for sharing the work. correct, we're currently working on evals for skills so you can compare skills between models and harnesses.

we wrote a blog on getting agents to write CUDA kernels and evaluating them: https://huggingface.co/blog/upskill


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: