Others mentioned qwen3, but which works fine with HN stories for me, but the comments still trip it up and it'll start thinking the comments are part of the original question after a while.
I also tried the recent deepseek 8b distill, but it was much worse for tool calling than qwen3 8b.
I also tried the recent deepseek 8b distill, but it was much worse for tool calling than qwen3 8b.