Hacker Newsnew | past | comments | ask | show | jobs | submit | dmk's commentslogin

The benchmarks are cool and all but 1M context on an Opus-class model is the real headline here imo. Has anyone actually pushed it to the limit yet? Long context has historically been one of those "works great in the demo" situations.

Paying $10 per request doesn't have me jumping at the opportunity to try it!

The only way to not go bankrupt is to use a Claude Code Max subscription…

Yeah, just had to upgrade to Max 20x yesterday because of hitting the limits every day and the extra usage gets expensive very fast.

Makes me wonder: do employees at Anthropic get unmetered access to Claude models?

It's like when you work at McDonald's and get one free meal a day. Lol, of course they get access to the full model way before we do...

Boris Cherny, creator of Claude Code, posted about how he used Claude a month ago. He’s got half a dozen Opus sessions on the burners constantly. So yes, I expect it’s unmetered.

https://x.com/bcherny/status/2007179832300581177


Seems quite obvious that they do, within reason.

Don't most jobs have unmetered access? I know mine does

Opus 4.5 starts being lazy and stupid at around the 50% context mark in my opinion, which makes me skeptical that this 1M context mode can produce good output. But I'll probably try it out and see

Has a "N million context window" spec ever been meaningful? Very old, very terrible, models "supported" 1M context window, but would lose track after two small paragraphs of context into a conversation (looking at you early Gemini).

Umm, Sonnet 4.5 has a 1m context window option if you are using it through the api, and it works pretty well. I tend not to reach for it much these days because I prefer Opus 4.5 so much that I don't mind the added pain of clearing context, but it's perfectly usable. I'm very excited I'll get this from Opus now too.

If you're getting on along with 4.5, then that suggests you didn't actually need the large context window, for your use. If that's true, what's the clear tell that it's working well? Am I misunderstanding?

Did they solve the "lost in the middle" problem? Proof will be in the pudding, I suppose. But that number alone isn't all that meaningful for many (most?) practical uses. Claude 4.5 often starts reverting bug fixes ~50k tokens back, which isn't a context window length problem.

Things fall apart much sooner than the context window length for all of my use cases (which are more reasoning related). What is a good use case? Do those use cases require strong verification to combat the "lost in the middle" problems?


Living in the EU, I'm skeptical any of this happens. Our leaders have been pretty reluctant to push back on anything so far and most of these assets are private anyway.


Wouldn't this be done by individual institutions and countries, not all once by "the EU?"

Evidence of that:

> Danish pension fund divesting US Treasuries

https://news.ycombinator.com/item?id=46692594


That's a tiny barely significant amount, though.

However the amount of US treasuries Denmark holds but privately and publicly did decrease by 20% or so over the last yea which I guess is something..


Fair point. Though I wonder if individual fund moves actually move the needle here or if it's mostly symbolic until it becomes a trend.


I believe that the best political speech of our time has just been presented. [0]

I believe that you might be a fellow European. If you happen to have 30 minutes to listen, I would love to hear your feedback.

[0] https://www.youtube.com/live/dE981Z_TaVo?t=100s


Hi Troy, just wanted to let you know that I just sent you an email! :)

Also, just to be sure, I sent it to on-board.ai domain as well, as that seemed like the correct website (onboard.ai just showed "for sale" page). Might help some others too.


Wow, looks amazing, will definitely apply!

Just FYI, the link next to the Founding Engineer is leading to Founding Creator instead of the Founding Engineer: https://mitteai.notion.site/Founding-Engineer-254f3cdf01fb80....


Looks like the correct URL is https://jobs.ashbyhq.com/PlantingSpace.


Google login also seems to be having issues, multiple people reported to me that the login isn’t working and they’ve been logged out of their Google accounts.


Yes, I tried logging in today in two distinct Google accounts on separate Chrome profiles and it would sign me out in about ~ 5 seconds after logging in. And the login process was very sluggish.


This is great! This brings me back to my childhood, I loved this game, really looking forward to trying it out.


> Tens of thousands of people each year receive a series of shots to prevent rabies after a possible exposure. It normally costs between $1,200 and $6,800. Not in this case.

Even that "normal" cost is absolutely crazy.


Most Americans don't realize how bad they have it. They've grown accustomed to being punched and slapped around and so won't rebel. They'll keep giving all of what little and shrinking of what they have to legitimized criminals.


How? They don't have internet?

Even in Zurich, Switzerland, one of the most expensive places and healthcares in the world, it's 85 CHF / shot:

https://reisemedizin.uzh.ch/en/pre-travel_advice/rabies

Emergency room is 50 CHF extra in non-emergency cases:

https://www.thelocal.ch/20210617/emergency-room-visits-to-no....


We do have Internet, but we've gotten used to being told that the Internet lies to us. We've been repeatedly told that people wait months for treatment in the UK, and that Canadians are streaming over the border to get health care in America.

We read horror stories like this one, but say "Whew, glad that won't happen to me." We imagine that because of capitalism, if our insurance company screws us over, we'll change to the next one -- freedom we wouldn't have if we had a national health care.

It never seems to occur to us that all of the private insurers have a capitalism-driven goal of maximizing profits, and national insurers don't.


> Stable Diffusion? Well, that’s free-ish. Problem is, you’ll probably want to run it locally, which requires a really, really beefy graphics card. I was struggling to run it on a Vega56 - a GPU that goes for ~$150 used now - so I went out and got a RTX3090 for about $1,000. If you’re already a gamer with a GPU with 8Gb+ of VRAM you’re probably good, but for most people this is a bit absurd.

I agree with the problem, on platforms like AWS you'll even need to send a manual request so they would let you use instances which can run SD. On the other hand there's already something like replicate.com, which allows you to run the SD like an API. I hope there will be more services like this.

https://replicate.com/blog/run-stable-diffusion-with-an-api


I'm a bit surprised they don't mention Snap anywhere on the site except for a link in footer (something like "by Snap"). Generally I would expect much better experience from a single page website promoting one product.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: