ha, exactly... like, the % change could be minuscule (or worse, it might only be a perceived difference, the actual quality may have regressed, or the scenario just didn't lend itself to that specific model) but people will be on here proclaiming that they're now shipping 10x the number of PRs.
for me at least, yes. just wrote it to coworkers this afternoon. Behaves way more "stable" in terms of quality and i don't have the feeling of the model getting way worse after 100k tokens of context or so.
What i notice: after 300k there's some slight quality drop, but i just make sure to compact before that threshold.
reply