Tried. The context windows just weren't big enough.

coder543 · 2026-06-15T19:06:28 1781550388

Qwen3.6-27B supports a 1 million token context window.

Of course, you have to have the right hardware to be able to run with a context window like that, as it takes about 100GB of memory on my DGX Spark to do that with full f16 KV cache on the q4_k_xl model.

lysace · 2026-06-15T17:20:55 1781544055

Got a similar result (my RTX 4070 only has 12 GB). I'm curious about whether 24/32 GB meaningfully improves this enough to make it useful.

tobyhinloopen · 2026-06-15T17:41:48 1781545308

Try it on RAM and CPU.

It’s slower but you can run them.

lysace · 2026-06-15T18:25:19 1781547919

Good idea for evaluating the models, thanks.

deadbabe · 2026-06-15T17:06:52 1781543212

Prompt more directly instead of open ended.