More

scriptsmith · 2026-05-10T21:22:42 1778448162

I've got some demos of what the new Prompt API in Chrome that uses a local model can do: https://adsm.dev/posts/prompt-api/#what-could-you-build-with...

As OP says, it shines in constrained environments where the model is transforming user-owned data. Definitely less useful for anything more open-ended.

2ndorderthought · 2026-05-10T21:26:22 1778448382

Yea I do not recommend treating chromes prompt API as a good example of local LLMs. It's fine and stuff but it's really weak. 8b models from a year ago are better in some ways. And a lot of the recent model drops are meaningfully better.

scriptsmith · 2026-05-10T21:30:04 1778448604

It's based on a Gemma 3n model, and yeah it's not the best. But if you have a use case that needs constrained JSON output for example, it's pretty neat.

Maybe it would do better with the new Gemma 4 models, which the Chrome devs have been hinting at moving to. And why the API doesn't let you introspect / pick the model, I'm still not sure.

robot-wrangler · 2026-05-10T23:17:29 1778455049

> I've got some demos of what the new Prompt API can do: > Use surrounding context to rewrite your ad copy:

Yup, that's the plan. No local model, no webpage; more, better and cheaper adtech extortion/surveillance for vendors while everyone else pays for the juice and hardware degradation.

dakolli · 2026-05-10T21:46:15 1778449575

So you're running an llm to do data transformation that deterministic processes would be much better suited for and running 1,000 watt power supply to do so. Wild.

scriptsmith · 2026-05-07T22:41:41 1778193701

Chromium mostly does not support this, because it doesn't have the binary blob required to run the inference. However, it does still download the model weights and expose the LanguageModel API, because that part is hooked up.

https://adsm.dev/posts/prompt-api/#which-browsers-support-th...

Packagers might eventually disable that but I tested this behaviour in chromium 148 a few hours ago, and it would download the weights but has trouble running them.

scriptsmith · 2026-05-06T23:23:56 1778109836

> My understanding is that you’ll have to explicitly agree to download alternate models in the future, per the specification

I don't get that from the spec. Closest thing I could find in the Writing and Assistance APIs spec [1] (which the Prompt API refers to) says:

> [It] allows the user agent to prompt the user for permission

But that's talking more about initiating the 'download' process more than choosing the model.

I think Chrome wants the browser to be 'opinionated' about the model used however, rather than letting the webpage pick, at least for now.

[1] https://webmachinelearning.github.io/writing-assistance-apis...

scriptsmith · 2026-05-06T00:13:55 1778026435

Chromium doesn't support this API because it needs a binary blob to run the inference, although in theory it may still be configured to download the weights:

https://adsm.dev/posts/prompt-api/#which-browsers-support-th...

scriptsmith · 2026-05-05T21:58:53 1778018333

Author here. After trying out the Prompt API over the last week, I wrote up some details on the chromium internals, how to use the API, and made some toy demos.

It's a 4 GB model that can be used to run on-device inference.

scriptsmith · 2026-05-05T08:26:02 1777969562

If Chrome has the #optimization-guide-on-device-model and #prompt-api-for-gemini-nano flags enabled, either because it's part of some Origin Trial / Early Stable Release or something, then web pages will have access to the new Prompt API which allows any webpage to initiate the (one-time) download of the ~2.7 GiB CPU or ~4.0 GiB GPU model using LanguageModel.create()

https://developer.chrome.com/docs/ai/prompt-api

When Chrome 148 releases tomorrow, this will be the default behaviour on desktop.

To download, it should check for 22 GiB free disk space on the volume where your Chrome data dir is, and at least double the model size of free space in your tmp dir.

21asdffdsa12 · 2026-05-05T14:59:34 1777993174

First the tabs came for the RAM and i did not protest, for i had plenty. Then they came for the chip and i did not protest, for it was dark silcon anyway. Then they came for the HDD.

oaiey · 2026-05-05T20:02:14 1778011334

And then they made the ram and ssd so expensive :)

bearjaws · 2026-05-05T20:03:32 1778011412

I am curious if it reuses the LLM across all tabs, hard to imagine most machines can boot up 1-2 of any 4gb model unless its a more powerful system.

nsvd2 · 2026-05-05T21:05:09 1778015109

I think it obviously will, what would be the benefit to spinning up more than one copy?

wtallis · 2026-05-05T21:45:34 1778017534

It should only need to load one copy of the weights, but each tab/site will need a separate context and KV cache.

doctorpangloss · 2026-05-05T18:12:04 1778004724

Okay, but the browser is basically the computer for most people.

underlipton · 2026-05-05T15:28:30 1777994910

Told ya.

maxloh · 2026-05-05T14:31:49 1777991509

The more severe problem is that Google installs model weight files on a per-user basis, meaning Chrome occupies 4 more GB of space for every OS user on your device.

bityard · 2026-05-05T14:49:09 1777992549

The company I work at has several environments and hundreds of VDI users in each environment. Chrome is the default browser in all of them. By my rough napkin math, this one small change by Google will eat up at least 15 terabytes of new disk space in total. (I sure hope we are using deduplication at the physical storage layer...)

throwway120385 · 2026-05-05T14:59:33 1777993173

It's fine. Network and disk space are free, right?

charcircuit · 2026-05-05T19:55:08 1778010908

Compared to human labor it is.

account42 · 2026-05-06T09:08:03 1778058483

Only because those who can save on the labor are not paying for the increased resource use in the first place.

niutech · 2026-05-09T16:07:49 1778342869

Serious question: why do you use Google Chrome in the first place, when there are better alternatives, e.g. Brave (crypto stuff could be easily disabled) or Vivaldi, both with adblocker?

tbrownaw · 2026-05-05T23:53:35 1778025215

Shouldn't the filesystem be set to encrypt everything before it hits the physical storage layer?

dburkland · 2026-05-05T19:37:57 1778009877

Thankfully deduplication is a thing ;)

Pay08 · 2026-05-05T16:00:16 1777996816

I certainly hope you don't automatically update.

TheRealDunkirk · 2026-05-05T19:27:58 1778009278

Does your place review every line of every update patch note? Do you think you would catch this implication?

BiteCode_dev · 2026-05-05T18:15:56 1778004956

Does each playwright (or similar automation system) count as a different user, and does it keep the model around ?

If yes, it's an interesting API to call when a AI crawler hit your website.

emegeve83 · 2026-05-06T12:36:17 1778070977

For every profile.

ai-x · 2026-05-05T16:15:21 1777997721

4GB, $0.10 (whatever the HD price) that is the equivalent of a High School level intelligent brain that can perform many cognitive tasks (and in the future even PhD level intelligence) for free?

Oh, the horror!!!

Wait, let me pay my HVAC guy $500 he deserved because he came all the way from his home to replace a fuse

recursive · 2026-05-05T16:42:54 1777999374

It doesn't make sense to apply wholesale prices for mass storage. People are running Chrome on specific devices that they already own. Storage is not fungible in this way.

beedeebeedee · 2026-05-05T16:59:57 1778000397

If you’re pissed you had to pay your HVAC guy to drive to your house and do something you think is trivial, why didn’t you do it yourself?

account42 · 2026-05-06T09:12:01 1778058721

As the saying goes, gp didn't pay $500 to have the fuse replaced, he paid $500 for the training and experience that was required to know that the fuse had to be replaced.

froggit · 2026-05-05T20:31:56 1778013116

> 4GB, $0.10 (whatever the HD price) that is the equivalent of a High School level intelligent brain that can perform many cognitive tasks for free?

This is better than my current solution of an actual human with masters degreed intelligence performing all my cognitive tasks for free how? I mean, i'm the first to admit i'm extremely lazy and even i'm over here like "really??"

hansmayer · 2026-05-05T18:40:30 1778006430

> Wait, let me pay my HVAC guy $500 he deserved because he came all the way from his home to replace a fuse

Right, because its totally something an LLM can do, right?

verisimi · 2026-05-05T17:37:53 1778002673

Here is your google brain on your device, whether you want it or not.

mcmcmc · 2026-05-05T18:12:51 1778004771

I don’t think you understand what “free” means

SkiFire13 · 2026-05-05T17:45:47 1778003147

Tell that to Apple, I'm sure they will allow me to pay $0.025/GB for additional storage on my Macbook /s

tayo42 · 2026-05-05T18:05:57 1778004357

It's annoyingly imposible to add more disk space to laptops. I think mine is soldered.

account42 · 2026-05-06T09:13:57 1778058837

Apple laptops maybe. In many others it's just a normal M.2 NVMe module behind a screwed on bottom case plate.

sheept · 2026-05-05T15:16:45 1777994205

You can already trigger a 2 GB model download with the Summarizer API[0], which is already shipped in Chrome.

    Summarizer.create()

[0]: https://developer.chrome.com/docs/ai/summarizer-api#model-do...

I think this is a distinct model from the Prompt API, since the other shipped AI APIs use fine tuned models.

rafram · 2026-05-05T17:55:43 1778003743

Both of them say they use Gemini Nano.

crumpled · 2026-05-05T16:36:27 1777998987

So now we're up to 6 GB

entropicdrifter · 2026-05-05T19:28:30 1778009310

Per user

Vinnl · 2026-05-06T13:19:57 1778073597

Also note the Mozilla standards position on this API: https://github.com/mozilla/standards-positions/issues/1213#i...

Or this summary on its status:

> Mozilla: Opposed

> WebKit: Opposed

> Microsoft: Several concerns

> W3C TAG: Several concerns

> Developers: Mostly negative

From https://mastodon.social/@jaffathecake/116527007495775507

ddtaylor · 2026-05-05T20:05:32 1778011532

The problem is that some of us are still on connections that charge per GB in rural areas. Here in Montana it's very common to pay about $0.25 per GB regardless of how much you use, so this is a $1 additional cost per desktop device. Places like public school districts have hundreds of computers and this will be somewhat significant for them.

marklubi · 2026-05-06T03:05:42 1778036742

I was thinking a similar thing. Many of our customers have purpose use computers that rarely see physical infrastructure internet, but need a modern browser (many chose Chrome on their own, we never recommended it).

They're going to get blasted with cellular data charges when they fire up their computer in the field.

McGlockenshire · 2026-05-05T23:28:41 1778023721

Google's updater service also currently ignores the windows 11 metered connection hint. It will gladly download that model over your cell connection even if you have a data cap.

This is infuriating behavior.

Silicon Valley must wake up and understand the entire world does not live like them.

bjelkeman-again · 2026-05-06T19:11:59 1778094719

They live in a bubble and not a lot of the surrounding world makes it in to them. I know it is hyperbolic, but I lived there for a while and I stand by that opinion.

wuschel · 2026-05-05T09:07:52 1777972072

It is a small model, so what utility can I / Google expect from it? What is the on-board model used for?

2ndorderthought · 2026-05-05T10:03:14 1777975394

It's not a very good small model to be honest.

That said, you might be surprised to learn that some of the models from 3b-9b could probably replace 80% of the things nonvibe coders use chatgpt for.

Its a good idea to run small models locally if your computer can host them for privacy and cash saving reasons. But how can you trust Google to autoinstall one on your machine in 2026? I just couldn't do it.

imglorp · 2026-05-05T11:29:58 1777980598

Sure, local models good and yes, there's no way we can trust Google.

We can be positive the entire motivation of Chrome is user behavior surveillance. There's not a nano-chance in all the multiverses that Chrome model is doing anything privately. They've gone to extraordinary length to accomplish this. It's not for free.

reactordev · 2026-05-05T12:59:40 1777985980

It is entirely about user surveillance as well as pushing their product on to their users because they have the install base. Google Chrome has become Microsoft IE6 in hostile user behavior.

aftbit · 2026-05-05T13:12:59 1777986779

You either die a hero or live long enough to see yourself become a villain.

What did we expect when they dropped "don't be evil" from their company values?

reactordev · 2026-05-05T13:29:53 1777987793

A claim about as useful then as it is now. They never wanted to be anything but, once Sergei left. The Schmidt era had them publicly declare one thing while doing something else entirely behind the curtain.

coldtea · 2026-05-05T13:44:44 1777988684

They were corporate evil from day 1. The rest was just PR slogans, and playing the good guy as long as you don't need to squeeze profits.

xnx · 2026-05-06T04:33:45 1778042025

If Google were focused on surveillance, why haven't they been collecting keystroke data (like grammarly) for years?

philip1209 · 2026-05-05T14:06:43 1777990003

Isn’t it really “pushing a feature to their products”?

reactordev · 2026-05-05T14:40:42 1777992042

Not when you are appropriating 2GB or more of space for that feature.

akoboldfrying · 2026-05-05T11:43:09 1777981389

I don't trust them either, but the same Google makes Gemma 4 available to run as locally and privately as you want, and those models are pretty amazing for their size.

imglorp · 2026-05-05T16:15:30 1777997730

Both can be true: they give a nice local model so you find it useful AND the chrome harness captures every token in and out for exfiltration.

marcta · 2026-05-05T21:26:30 1778016390

LLMs are costing Google a ton of money in compute and storage right now. If they can farm any of that off to the users, it makes economical sense.

But yes, there is a 100% chance that logs will get sent back to Google too.

imglorp · 2026-05-06T10:16:11 1778062571

> farm

Ooh, this is interesting. There's nothing stopping them from sending jobs down to local machines. That's some 3 billion nodes. We went through this with coin mining and spam botting.

Nothing stopping it except your ire if it's discovered.

Ajedi32 · 2026-05-05T13:45:42 1777988742

> But how can you trust Google to autoinstall one on your machine

Why are AI models something I'd be uniquely unable to trust Google to install, compared all the other code included in Chrome updates? Is your point just that you shouldn't trust Chrome in general?

2ndorderthought · 2026-05-05T19:12:41 1778008361

Yes I would not trust Google or chrome. They have a history of class action lawsuits for doing shady things to users. Enabling them to condense data on your machine and transmit it however they want, should they choose too is suspect to me.

jm4 · 2026-05-05T19:12:25 1778008345

Google is probably still sucking up the contents of your LLM requests even with the model running locally.

jauntywundrkind · 2026-05-05T15:37:04 1777995424

Yeah, so unclear why yer again everyone is so quickly running for the pitchforks & torches. The model doesn't do anything, it's just a sandbox.

I'm really tired of such overinflated ridiculousness shrillness against Google. Yes there are very real tensions to this company and their as business is scary as heck.

But folks don't seem capable of processing duality, don't seem to be able to do much but ad-hominem until they pass out. Its really so exhausting having such empty energy charging in every single time, and it keeps obstructing any ability to think straight or assess.

jcgrillo · 2026-05-05T21:10:56 1778015456

> The model doesn't do anything, it's just a sandbox.

Doesn't that make it worse? They forced everyone to download 4GB of crap for nothing. They could have done one of two things:

(1) bundle the model with the application so you can tell ahead of time you're signing up for 4GB of bandwidth usage or

(2) make downloading the model some kind of opt-in thing.

Either of those would have worked. Just because you can easily tolerate 4GB of unplanned bandwidth usage doesn't mean everyone who can't is wrong.

pbmonster · 2026-05-05T18:33:36 1778006016

I was waiting for Google to pull a local LLM onto Chrome/Android devices. It opens up some revenue streams that weren't easily possible before: for example the often memed "I was talking about cigars with my wife one single time and now all I see are adsense ads for cigars" gets much easier with a local model doing speech to text and topic classification.

froggit · 2026-05-05T20:36:46 1778013406

> Yeah, so unclear why yer again everyone is so quickly running for the pitchforks & torches.

Cause everyone loves a good bonfire and a fresh hot roast.

thegrim33 · 2026-05-05T19:16:19 1778008579

The point is that what you're "sick of" isn't actually authentic human thought, but in reality you're responding to a recent european-driven propaganda campaign with the goal of deriding anything and everything related to US tech.

wildrhythms · 2026-05-05T13:42:56 1777988576

All that matters is some MBA product manager at Google was celebrated for shipping this. Hooray!

elphinstone · 2026-05-05T14:34:26 1777991666

Everyone who implemented or approved this should be prosecuted under the Computer Fraud and Abuse Act (18 U.S.C. § 1030). If I was on a jury, I wouldn't hesitate to send them to prison where they belong.

hluska · 2026-05-05T18:55:03 1778007303

A fair and impartial jury is a fundamental part of freedom. I genuinely cannot believe that we have been reduced to wanting to destroy the jury system to punish companies we don’t agree with. At this point, this is less activism and more weaponized disrespect for fundamental freedoms.

ahupp · 2026-05-05T19:45:07 1778010307

What is the principle you’re using here?

soco · 2026-05-05T11:31:11 1777980671

Which is why I uninstalled Chrome a (short...) while ago and my life went on unbothered.

raddan · 2026-05-05T15:48:55 1777996135

I am amused when people fret about not using Chrome. I get it but… I have literally NEVER used Chrome. Perhaps I just don’t know what I am missing but the web seems to work just fine for me without it?

Danox · 2026-05-05T16:58:40 1778000320

Touché…

tsss · 2026-05-05T12:32:19 1777984339

Half of the reason to use local AI is to circumvent the censorship that Google, OpenAI and so on have. I don't want this Google crap on my computer.

safety1st · 2026-05-05T17:20:36 1778001636

> That said, you might be surprised to learn that some of the models from 3b-9b could probably replace 80% of the things nonvibe coders use chatgpt for.

Really? I'm a total amateur when it comes to doing anything with local models but I tried a few in this range using ollama at this point, and they didn't seem to know much about anything, and I couldn't figure out how to get them to search the web or run other tools, so that was where the experiment ended.

A small local model that can use bash would be a bit of a game-changer for me.

hluska · 2026-05-05T19:01:13 1778007673

Local models are improving quickly so if you keep an eye open you’ll find something soon enough. But from experience, I’ll warn you that local models can lose the plot very quickly. Their little self arguments when they get stuck usually come down to:

- It failed? This must be a mistake, I’ll try it again. It failed? This must be a mistake, I’ll try it again because then I will complete the task (repeat about every six seconds until you rescue it).

- You know, the best way to deal with a permissions problem is to erase the entire system. That’ll definitely solve those pesky permissions and I’ll complete the task.

svachalek · 2026-05-05T19:42:12 1778010132

The latest small models are now reliable enough at simple tools like web search I think. It's just afaik none of the user friendly harnesses like ollama or LMStudio have a real one-click setup flow for this. You'll need to download models and do a fair bit of tool configuration.

xnx · 2026-05-06T04:36:02 1778042162

Gemini CLI can use bash and run on the Gemma local model.

scriptsmith · 2026-05-05T09:25:25 1777973125

It's based on Gemma 3n, and it's not the best.

I find it works fine for simple classification, translation, interpretation of images & audio. It can write longer prose, but it's pretty bad.

It can also write text in the format of a JSON schema or regexp for anything you might want to do with structured data.

Wowfunhappy · 2026-05-05T11:22:00 1777980120

I wonder why they’re using Gemma 3 and not Gemma 4?

scriptsmith · 2026-05-05T11:32:40 1777980760

Google has been trialling the Prompt API in chrome for the over a year, so before Gemma 4 existed. But they are indicating they'll move to Gemma 4: https://groups.google.com/a/chromium.org/g/blink-dev/c/iR6R7...

dotancohen · 2026-05-05T11:41:31 1777981291

So that the big news in non-tech news sites will be the update. Thus ensuring that this is received in a positive light.

andy_ppp · 2026-05-05T11:29:10 1777980550

It'll probably update to that without telling you at some point.

kevincox · 2026-05-05T14:10:44 1777990244

I find models of this size (not tested this one specifically) at being very good at simple data extraction from user input. Think about things like parsing date and time of an event from a description or parsing a human-typed description of a repeating event rule.

rmac · 2026-05-05T16:49:29 1777999769

this is considered a large model. i think you might be surprised how many "small" models chrome has already pulled down on your disk.

but to answer your question: one of the services that uses a small model: PermissionsAIv4

""" Use the Permission Predictions Service and the AIv4 model to surface permission notification requests using a quieter UI when the likelihood of the user granting the permission is predicted to be low. Requires `Make Searches and Browsing Better` to be enabled. – Mac, Windows, Linux, ChromeOS, Android """

michaelbuckbee · 2026-05-05T11:45:45 1777981545

I ran a fairly large production test of this and on _every_ measure except for privacy it was worse than a free tier server hosted LLM.

Not happy about that as I would like to see more local models but that's the current state of things.

https://sendcheckit.com/blog/ai-powered-subject-line-alterna...

gchamonlive · 2026-05-05T12:42:59 1777984979

> on _every_ measure except for privacy it was worse than a free tier server hosted LLM

Would you be able to compare this to other local models in it's class and a above that would fit consumer-grade hardware?

accrual · 2026-05-05T13:57:41 1777989461

> It is a small model, so what utility can I / Google expect from it?

Precedence for shipping models alongside consumer software.

Potentially without consent if it truly is a silent install.

hightrix · 2026-05-05T15:54:54 1777996494

Something to do with serving more ads. My guess is they will use this to “better target” or to drain more information from you for their ads.

tobylane · 2026-05-05T08:37:23 1777970243

Those two (and more) exist in chrome://flags in Chrome 147. I'm disabling them now, with the expectation that will prevent the new default.

One option I'm leaving as default is "Use LiteRT-LM runtime for on-device model service inference." Any comment on that?

RaiausderDose · 2026-05-05T11:24:22 1777980262

I'm on Chrome 147 too and disabled:

"optimization-guide-on-device-model"

- Enables optimization guide on device

"prompt-api-for-gemini-nano"

- Prompt API for Gemini Nano

- Prompt API for Gemini Nano with Multimodal Input

and deleted weights.bin and the 2025.x folder in "OptGuideOnDeviceModel"

Will report if Chrome 148 downloads the model again.

phs318u · 2026-05-05T11:38:50 1777981130

If you touch those files into existence and chown to root and chmod to 0, it shouldn’t be able to ever overwrite them right?

sethops1 · 2026-05-05T19:52:23 1778010743

You want to use chattr +i (make the empty file immutable)

pmontra · 2026-05-05T11:48:03 1777981683

I'm on my phone now so I can't check if something has changed, but what you want to protect from change is the directory, not the files. A file can be deleted and created again if the process can write the directory.

RaiausderDose · 2026-05-05T11:44:18 1777981458

yeah, should work. Will try readonly on windows too.

Now I can't see it anymore, but shouldn't the model be under chrome://on-device-internals/ -> model-status?

Maybe you can uninstall there too.

Markoff · 2026-05-05T12:10:56 1777983056

thanks, went to flags in Vivaldi and just in case disabled all flags containing "gemini" and first five results for "model"

beaugunderson · 2026-05-05T15:11:30 1777993890

maybe I was on the wrong side of the early release but I’ve deleted this model many times in the last year. I’ve had it for at least 12 months.

RaiausderDose · 2026-05-08T16:07:06 1778256426

it downloaded the model again...

scriptsmith · 2026-05-05T08:46:52 1777970812

Those flags will exist already, but will default to enabled in 148.

That other flag is for using a different open-source inference engine to the (from what I can tell) closed-source one that's used by default.

Twirrim · 2026-05-05T21:54:22 1778018062

Searching about:flags for model comes up with a whole bunch:

#omnibox-ml-url-scoring-model

#omnibox-on-device-tail-suggestions

#optimization-guide-on-device-model

#text-safety-classifier

#prompt-api-for-gemini-nano

#writer-api-for-gemini-nano

#rewriter-api-for-gemini-nano

#proofreader-api-for-gemini-nano

#summarizer-api-for-gemini-nano

#on-device-model-litert-lm-backend

Then around gemini but not caught by the search for models: #skills (maybe? I think this is implied by "gemini in chrome"?)

edit: I don't see a carte blanch AI disabling option. As much as I dislike Mozilla's growing obsession with AI, at least they give me a top level option to disable all AI stuff. I only keep Chrome around for occasional testing reasons.

d3Xt3r · 2026-05-05T21:10:30 1778015430

So my understanding of that is that the download happens only when sites call the Prompt API right?

Because my Chrome stable has been updated to v148 now, and I don't see any AI models in my user profile folder. My profile size is only 328 MB, with the Code Cache subfolder occupying the most space (135 MB).

scriptsmith · 2026-05-05T21:15:24 1778015724

In my understanding, yes. I wrote a blog post about some of the internals here: https://news.ycombinator.com/item?id=48028662

codethief · 2026-05-05T18:49:09 1778006949

Next step: Invoke the prompt API from within online ads and run a "p2p" AI inference provider which forwards incoming LLM queries to website visitors. :-)

scriptsmith · 2026-05-06T04:23:34 1778041414

I wrote a more detailed blog post here:

https://news.ycombinator.com/item?id=48028662

jimmaswell · 2026-05-05T16:49:45 1777999785

This sounds perfectly reasonable. No objection from me.

jadbox · 2026-05-05T23:40:35 1778024435

I believe webpages that use the API must request from the user via a system permissions dialogue to aces the prompt API, according the docs a few months ago.

scriptsmith · 2026-05-05T23:46:59 1778024819

It can only be called after the user has interacted with the page, but there's no dialogue from the browser

https://developer.chrome.com/docs/ai/get-started#user-activa...

arendtio · 2026-05-06T19:38:37 1778096317

/dev/mapper/vg_system-arch 207G 192G 4,7G 98% /

Just don't keep free space around :-D

madduci · 2026-05-06T07:49:31 1778053771

Do you know if also Chromium has thesenfkags enabled?

scriptsmith · 2026-05-06T08:06:37 1778054797

Depends on where you get it. By default the flags will be enabled, but some packagers may choose to disable them. I haven't seen a major distro release chromium 148 yet.

Weirdly though, chromium won't be able to actually use the model even though it can download it, because the inference engine is a closed-source blob.

https://adsm.dev/posts/prompt-api/#which-browsers-support-th...

hzwanip · 2026-05-06T19:45:07 1778096707

I think it's great, LFG Chromium OSS

scriptsmith · 2026-03-10T05:13:19 1773119599

Yes, you could turn it around to say that using Anthropic models in Cursor, Copilot, Junie, etc. is 'subsidising' Claude Code users.

scriptsmith · 2026-03-07T12:14:16 1772885656

The "First-class syntactic selection" reminds me of my most used shortcut(s) in Jetbrains IDEs: the Expand / Shrink Selection.

  Ctrl + W
  Ctrl + Shift + W

https://www.jetbrains.com/help/idea/working-with-source-code...

It really changed my perspective on interacting with the 'text' of a file.

VS Code, Zed, etc. have similar operations, but in my experience they expand and shrink too coarsely.

jasonjmcghee · 2026-03-07T16:27:05 1772900825

Mine are:

Cmd+Shift+V - Stacked clipboard, you can start typing to search or hit a number to choose what to paste (keeps everything you've copied/cut inside jetbrains for a while)

Cmd+Shift+E - Recent locations, you can start typing to search - shows little buffers of where you've been recently

Cmd+Shift+A - Action tab of the command palette - fuzzy search for any command (really the only shortcut you need, other than maybe Shift+Shift for main command palette shortcut)

--- Through the Action bar...

Local History / Local History of Selection - you can start typing to search quite far back the history of all changes of the current file or selection - you can also right click a folder or the project and do the same. Much finer grained than git.

The general concept of being able to search for something and edit directly in the buffer of the search results.

the__alchemist · 2026-03-07T16:32:33 1772901153

Hero! I had not done my homework/have not been aware, but these all look fantastic! The stacked clipboard is something I periodically mentally complain about (Why is clipboard on every OS/tool I've used single item?)

I will add one that are possibly more well-known:

  - ctrl + shift + F: Find text in any file
  - ctrl + N: Find types (structs, classes etc)
  - ctrl + shift + N: Find any file by name or path

leoc · 2026-03-07T17:07:09 1772903229

Windows has had a pretty usable stacking clipboard for a while! You just have to activate it. Since you can pin thing into it it’s also quite useful as a rough and ready way to type special characters you use frequently.

the__alchemist · 2026-03-07T17:29:07 1772904547

Wow. Looks to not even be a PowerToys feature. Win + V

jasonjmcghee · 2026-03-07T20:29:02 1772915342

I use shift+shift and type for all three of these.

Although i do use cmd+shift+r for global replace

JadeNB · 2026-03-08T03:06:34 1772939194

> Why is clipboard on every OS/tool I've used single item?

The wonderful Flycut (https://apps.apple.com/us/app/flycut-clipboard-manager/id442...) fixes this on macOS.

hhhAndrew · 2026-03-07T16:11:14 1772899874

Mathematica is the earliest thing I am aware of with this feature where it was Alt+. to expand selection in their notebook interface starting in the early 90s. But the thing I miss most that I still can't shake the muscle memory of after almost a decade of not using much Mathematica, is that single/double/triple/n-click scaled this way as well. So double-click selected a whole word (as in all editors), triple-click selected all the comma-separated multiple args of a function, 4-click for f(a,r,g,s), and so on.

lejalv · 2026-03-07T19:07:27 1772910447

Ctrl-Space in TeXmacs, where the document is an actual tree (http://texmacs.org)

enkursigilo · 2026-03-07T12:22:24 1772886144

Also available as `incremental selection` in Neovim via tree-sitter.

hudsonwillis · 2026-03-07T16:52:08 1772902328

yep and most of these actions can be implemented with simple mappings based on https://github.com/nvim-treesitter/nvim-treesitter-textobjec...

gritzko · 2026-03-07T18:00:53 1772906453

I work on AST based revision control. I have a stack of ideas on how to achieve the same Ctrl+W effect with commits/diffs/cherry-picks. All still in flux. If you have some thoughts to share, please do.

[1]: https://github.com/gritzko/librdx/tree/master/be#readme

chamomeal · 2026-03-07T15:06:44 1772896004

I use it constantly in helix too. The vscode one is meh. I think I saw a discussion in github once about switching to tree-sitter, which would improve AST-related actions. I don't think it went anywhere though.

I love AST aware editing. I think it's one reason it's always been so nice to edit lisps. Stuff that is complicated to describe in javascript (and doesn't have LSP support) p much requires a whole AST parser, but in lisp it's just a simple list operation. When I go back typescript after a weekend of clojure, I reeaally miss slurp! and other paredit commands

cyberax · 2026-03-07T22:30:32 1772922632

JetBrains also experimented with AST-based editing: https://www.jetbrains.com/help/mps/fast-track-to-mps.html#st...

An overview video: https://www.youtube.com/watch?v=XGm_khXZl44

I tried it, but it just was too clumsy. Sometimes refactoring/editing needs to go through phases where the AST is invalid, and MPS makes that just too clumsy.

But with AI this might be a different story.

mystifyingpoi · 2026-03-07T14:00:43 1772892043

I agree, top feature. Combined with things like "extract method" makes mundane refactorings super fast.

layer8 · 2026-03-07T15:30:52 1772897452

Yes, Java IDEs have had these since sometime in the 2000s.

exidex · 2026-03-07T16:00:25 1772899225

And I am sure that people have been complaining about the hand gymnastics you have to do to press those shortcuts since around that time as well

lkjdsklf · 2026-03-07T18:29:05 1772908145

It’s still some time in the 2000s and will be for the next 974 years

layer8 · 2026-03-07T22:35:58 1772922958

Not according to common usage: https://en.wikipedia.org/wiki/2000s

the__alchemist · 2026-03-07T16:47:27 1772902047

To me, it feels like Zed and VsCode perform most operations in a general way on the text; they don't seem to (in Python and Rust at least) have an understanding of the code structure in the way JB does. (And based on some digging on Ki the way it does as well?) So, I would bet they are using that text-based model, which would be hit/miss here.

joquarky · 2026-03-07T21:23:27 1772918607

One problem is I got so used to Ctrl-W that I use it in other applications and usually wind up inadvertently closing the tab.

tzot · 2026-03-07T23:46:37 1772927197

I had this issue too, so I remapped Ctrl-W/Shift-Ctrl-W to Ctrl-\/Shift-Ctrl-\ . (Also git operations became two-key sequences, starting with Ctrl-G and that damn Ctrl-K stopped being the shortcut for commit.)

parallax_error · 2026-03-07T13:31:54 1772890314

This has got to be my favourite feature of IntelliJ, along with the dumb context actions menu

upcoming-sesame · 2026-03-07T16:06:25 1772899585

Cool feature. what's the key for it in visual studio code?

lolpython · 2026-03-07T16:14:35 1772900075

Expand Selection: Alt+Shift+→ (Windows/Linux) or Option+Shift+→ (Mac) Shrink Selection: Alt+Shift+← (Windows/Linux) or Option+Shift+← (Mac)

the__alchemist · 2026-03-07T16:14:53 1772900093

Ty! I have been missing out. Adding this to the repertoire.

hurflmurfl · 2026-03-07T16:26:25 1772900785

Ah yes.

The shortcut I use the most in Jetbrains IDEs. Also the one I miss the most in VSCode (whatever is present there just doesn't seem to work right).

Also the shortcut that has caused me to close so many browser tabs inadvertently...

scriptsmith · on Oct 24, 2024

Yes, I've used the v3.2 3B-Instruct model in a Slack app. Specifically using vLLM, with a template: https://github.com/vllm-project/vllm/blob/main/examples/tool...

Works as expected if you provide a few system prompts with context.

scriptsmith · on Aug 5, 2024

To keep on-top of tabs in Firefox, I use 'Auto Tab Discard' [1] to discard tabs after a certain amount of inactivity. Then when I need to clean up my list of tabs, I click on any discarded tabs I want to keep, and then use my extension 'Close Discarded Tabs' [2] to clear the rest.

[1] https://addons.mozilla.org/en-US/firefox/addon/auto-tab-disc...

[2] https://addons.mozilla.org/en-US/firefox/addon/close-discard...

EasyMark · on Aug 5, 2024

Firefox does a "good enough" job of flushing tab memory on it's on. You don't really need these extensions.