Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Others mentioned qwen3, but which works fine with HN stories for me, but the comments still trip it up and it'll start thinking the comments are part of the original question after a while.

I also tried the recent deepseek 8b distill, but it was much worse for tool calling than qwen3 8b.



Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: