Other models arent even close except for gpt 5.5. You're dead wrong on that. You...

gck1 · 2026-06-10T02:52:07 1781059927

DeepSeek 4 Pro is performing agentic SWE tasks for me quite well. It can't do everything Opus can do, but if OpenAI and Anthropic disappeared tomorrow, I'd figure out ways to make it work with harness improvements and other optimizations.

Anthropic can stretch the moat all they want, but in the department of trust, they put a final nail in their coffin today. Anthropic is pure evil at this point.

jatora · 2026-06-10T03:39:08 1781062748

'evil' lol. Every single corporation you deal with is evil then. it's greed. and almost every large model provider is guilty of it. China is all open source right now. cool! gee i wonder what would happen if they ever actually achieved SOTA? They would clamp down on that so fast Dadio's dradel would spin

AuthAuth · 2026-06-10T04:49:06 1781066946

China isnt "all open source" they still keep their top models out of the public view. Its easy to "open source" models when they're so far behind very few will pay for them.

Open source in quotes because they are not open source and not even close to open source.

jatora · 2026-06-10T18:56:16 1781117776

And what models do they keep out of public view? What ridiculous propaganda is this?!

prmoustache · 2026-06-10T05:12:04 1781068324

Can't we stop using "open-source" when it is just freeware?

SXX · 2026-06-10T05:28:03 1781069283

Open-weight is both meaningful and unique term.

gck1 · 2026-06-10T03:56:23 1781063783

> Every single corporation you deal with is evil then.

I don't know. If my ISP started MITMing my traffic so that they could silently rewrite packets, and/or deleting files on my computer because they thought me sharing wireless AP with my SO was me trying to compete with them, I'd call them evil.

I believe they tried something similar to the first one a few years ago in the US, and I remember people called that evil to the point where tech giants shut down their websites in protest.

> gee i wonder what would happen if they ever actually achieved SOTA? They would clamp down on that so fast Dadio's dradel would spin

Cool. Let them "achieve SOTA" and close down the models. Let the pendulum swing the other way.

You seem to not understand what China's goal is here. They want the AI bubble to burst and take your 401ks with it. And OAI/ANTs decisions are driving you towards that cliff.

ggoo · 2026-06-10T02:43:14 1781059394

I use gpt 5.5 at work (because they pay for it) and DeepSeek at home (because I pay for it) and while I do agree one is better than the other, I think you’re really overstating how far apart they are. Just my take.

mirsadm · 2026-06-10T06:13:44 1781072024

What's 12 months lead time worth? Not much from what I can tell. Contrary to what these AI companies might tell you, if an AI model can't do it, a human can still do the work.

gitanovic · 2026-06-10T08:48:45 1781081325

Honest question, is it possible that since might be using the latest/best model to analyze and improve the existing ones, the moat will expand exponentially, making the models better and more efficient at each iteration until there is no point in competing?

jpfromlondon · 2026-06-10T10:41:08 1781088068

All models from the past two years are close in the general case.

This is just another incremental improvement, rushed out to boost the ipo, AI has the capacity to aid an engineer but this minor bump in performance will have essentially zero impact on the productivity of an engineer working on real world solutions when compared with any other major model.

We are trending towards asymtotic and it can't happen fast enough, that's when the true cost of this will become evident.

solenoid0937 · 2026-06-10T02:36:00 1781058960

Most of HN is stuck in this fantasyland where they insist their local LLM setup is comparable to Opus 4.8 or GPT 5.5. It's like a collective delusion, I've never seen anything like it.

written-beyond · 2026-06-10T02:41:09 1781059269

You can get really good results with Chinese models. You're putting Opus and GPT on too high of a pedestal.

solenoid0937 · 2026-06-10T02:56:36 1781060196

I use Chinese models (for simple personal projects), they just don't compare to GPT or Opus for any serious work.

I do not know why every Chinese model fan thinks that people that aren't impressed by them simply don't use them.

SXX · 2026-06-10T05:36:55 1781069815

Wast majority of software engineers do very little except of moving JSONs around and building CRUDs.

It's quite obvious that when you dont try to do something particularly complex there will be literally no difference between GPT, Claude, Gemini and Deepseek.

Fot many things I'm doing in gamedev Gemini 2.5 Pro was already good enough even though it released more than year ago.

Once you pass certain threshold it's just enough.

Vetch · 2026-06-10T11:15:47 1781090147

What constitutes serious work and how seriously have you tried to do serious work with them? While those trying to claim a 30B dense model can match Opus 4.6 are engaging in either beyond over-excessive over-exaggeration or performing rather routine tasks, it's disingenuous in the other direction to claim the latest open 1T models are not useful for serious work. I find those making such claims have rarely spent more than a few minutes on halfhearted attempts and often on recently obsoleted models.

Openweight models turned a corner around kimi 2.6, deepseek v4 pro/flash, hy3 and mimo 2.5 pro. Similar to how closed LLMs turned a corner around gpt 5.2 and opus 4.5.

While they remain a step behind closed frontier models, for real world tasks ranging across functional reactive programming, distributed systems, mathematical modeling, to-the-millisecond highly optimized spatial data-structures, complex compute shaders and shader effects and non-trivial systems involving parser combinators and algebraic effect systems, I can say that open models have very recently gone from useless to productive. For my work, mimo v2.5 pro is hands down better than sonnet 4.6.

bigbadfeline · 2026-06-10T02:51:49 1781059909

Some of the new and open models are very capable now, The truth is, the value of the model is in the mind of the user - the big names are impressive to those who know little and are dazed by little, but they are bound to end up wrong regardless of how good the model is.

jatora · 2026-06-10T03:34:42 1781062482

This is ridiculous. How about the rational users who use the best current model regardless of brand? The value of the model is in the quality of the output over time. I give every major model a chance. Coding and scripts in the chat are nothing compared to the power of agentic SWEEEEEEEEE. And nothing is remotely close to claude and gpt. If you're comfortable with being well behind SOTA intelligence, then good for you, but some of us prefer to be efficient with our time and resources. With your mindset, you will never truly SWEEEEEEEEEEEEEEEEEEEE

jpfromlondon · 2026-06-10T12:08:57 1781093337

that isn't rational, rational is using the model that can best solve your current problem in the timeliest cost considered manner.

I'm not working on the frontier problems, I don't need god-in-a-box for $600 per month.

jatora · 2026-06-10T16:45:56 1781109956

its not god in a box and its not $600 per month

and almost nobody is working on frontier problems. they just want frontier intelligence to solve their given problems in a superior manner.

you're minimizing and exaggerating all of the wrong things. cope more i guess - more compute for us!

jpfromlondon · 2026-06-10T21:39:37 1781127577

Your comment makes it pretty clear that mine went over your head and that's fine, these tools are for people like you, godspeed.