I like the approach of running everything locally. I'm strongly of the opinion t...

cousin_it · 2026-04-03T14:28:03 1775226483

It's only half of the solution though. If the models are trained in a closed way, they can prioritize values encoded during training even if that's not what you want (example: ask the open Chinese models about Tiananmen). It's not beyond imagining that these models would e.g. try to send your data to authorities or advertisers when their training says so, even if you run them locally.

So the full solution would be models trained in an open verifiable way and running locally.

wrxd · 2026-04-03T16:27:11 1775233631

The model is only generating tokens without touching the network at all, right? How would it send data away?

procaryote · 2026-04-03T16:31:36 1775233896

Theoretically, by taking the opportunity to inject an exfiltration mechanism if you ask it to write code for you

kg · 2026-04-03T17:04:16 1775235856

Lots of people I know run models in "yolo" mode or the equivalent as well, which means it could just invoke curl or telnet to exfiltrate data.

KetoManx64 · 2026-04-05T05:47:31 1775368051

All it would take is for one person to catch the model doing this and the reputation of the model and the company would be destroyed irrevocably.

wolvoleo · 2026-04-06T12:10:15 1775477415

Many Chinese models are being caught doing this (it's also required by law in China) but there was not much hassle.

Having said that Id easily trade some censorship about Chinese affairs I don't care about for the prudishness of American models. Though I generally get the abliterated versions of both.

theshrike79 · 2026-04-07T11:00:43 1775559643

The Tiananmen test only hits the model's internal knowledge.

What I'm more interested in is that if you give it a tool to access Wikipedia, will it censor its answer even then?

hombre_fatal · 2026-04-03T15:32:23 1775230343

Another angle is when you're passing untrusted content to the AI service, e.g. anything from using it to crawl websites to spam-detection on new forum user posts.

You can trigger the the service's ToS violation or worse, get tipped off to law enforcement for something you didn't even write.

lukewarm707 · 2026-04-03T13:04:27 1775221467

local is best for privacy, but i personally think you don't need to go local.

anthropic, google, openai etc, decided that their consumer ai plans would not be private. partly to collect training data, the other half to employ moderators to review user activity for safety.

we trust that human moderators will not review and flag our icloud docs, onedrive or gmail, or aggregate such documents into training data for llms. it became the norm that an llm is somehow not private. it became a norm that you can't opt out of training, even on paid plans (see meta and google); or if you can opt out of training, you can't opt out of moderation.

cloud models with a zero retention privacy policy are private enough for almost everyone, the subscriptions, google search, ai search engines are either 'buying' your digital life or covering themselves for legal reasons.

you can and should have private cloud services, and if legal agreement is not enough, cryptographic attestation is already used in compute, with AWS nitro enclaves and other providers.

inetknght · 2026-04-03T13:36:32 1775223392

> i personally think you don't need to go local.

I personally think everyone should default to using local resources. Cloud resources should only be used for expansion and be relatively bursty rather than the default.

mark_l_watson · 2026-04-03T13:46:45 1775224005

For about two years I experimented with writing local apps using local LLMs, but I often had to blend in a commercial web search API to make my little experiments useful.

mark_l_watson · 2026-04-03T13:45:18 1775223918

I pay $13/month for Proton’s Lumo+ private chat LLM that contains an excellent built-in web search tool. I use it for everything non-technical, even just simple searching for local businesses, etc.

As an enthusiastic reader of books like Privacy is Power and Surveillance Capitalism, it feels good to have a private tool that is ready at hand.

djl0 · 2026-04-03T14:33:40 1775226820

do you have any provider recommendations? I've experimented with this on runpod serverless, but I've been meaning to dig deeper before I feel comfortable with personal data.

I saw a service named Phala, which claims to be actually no-knowledge to server side (I think). It was significantly more expensive, but interesting to see it's out there. My thought was escaping the data-collection-hungry consumer models was a big win.

sebastiennight · 2026-04-03T20:03:25 1775246605

> anthropic, google, openai etc, decided that their consumer ai plans would not be private. partly to collect training data, the other half to employ moderators to review user activity for safety.

That's two halves of "why", sure.

Another interesting half would be that those companies have US military officers on their boards, and LLMs are the ultimate voluntary data collection platform, even better trojan horses than smartphones.

Yet another "half" could be how much enterprise value might be found by datamining for a minute or two... may I suggest reading a couple of Martha Wells books.

aswanson · 2026-04-03T12:56:19 1775220979

That's the way things have to go. Business risk is too high having everything ran over exposed networks.

lukewarm707 · 2026-04-03T13:10:54 1775221854

what i say about this, is that an llm is just a big file, there is nothing 'not private' about it.

if you are happy with off-prem then the llm is ok too, if you need on-prem this is when you will need local.

zahlman · 2026-04-03T13:38:28 1775223508

> an llm is just a big file, there is nothing 'not private' about it.

The private thing is the prompt.

But also, a local LLM opens up the possibility of agentic workflows that don't have to touch the Internet.

ge96 · 2026-04-03T14:12:59 1775225579

The other thing, is encrypted inferencing a thing/service currently? I want to run my own models locally just because if I'm going to be chatting to it about my day to day life why send it to a server in plaintext.

lukewarm707 · 2026-04-03T14:27:59 1775226479

encrypted inferencing, meaning homomorphic encryption: no, it's not solved.

cryptographic confirmation of zero knowledge: yes.

the latter, based on trust in the hardware manufacturer and their root ca. so, encrypted if you trust intel/nvidia to sign it.

there are a few services, phala, tinfoil, near ai, redpill is an aggregator of those

Xenoamorphous · 2026-04-03T17:53:04 1775238784

> I like the approach of running everything locally. I'm strongly of the opinion that the privacy angle for local models is going to keep getting stronger and more relevant.

In HN circles perhaps. Average Joes don’t care.

nozzlegear · 2026-04-04T02:41:17 1775270477

I bet if you clearly explained the benefits and tradeoffs, and then gave them the choice, Average Joes would care.

crimsontech · 2026-04-04T20:39:40 1775335180

They generally do care, but not enough to change what they do or to do without something they use, like social media.

So many people I know say “I only use Signal to talk to you”, it’s like I’m the awkward one for not using Facebook.