They could have been stored at the client, and encrypted before optionally synce...

nl · 2025-11-13T00:27:12 1762993632

Signal does End-to-end encryption, so they (Signal) can never read it.

The whole point of ChatGPT conversations is so they can be read by the model on the server.

Conversations are kept around because they can be picked up and continued at any point (I use this feature frequently).

Additionally you can use conversations in their scheduled notification feature, where the conversation is replayed and updates are sent to you, all done on the server.

> OpenAI made a choice in how the feature was and is implemented.

Indeed they did, and it was a sensible choice given how the conversations are used.

godelski · 2025-11-13T04:03:02 1763006582

You could definitely do this E2EE.

Models should run in ephemeral containers where data is only processed in RAM. For active conversation a unique and temporary key-pair is generated. Saved chats are encrypted client side and stored encrypted server side. To resume a conversation[0], decrypt client side, establish connection to container, generate new temporary key-pair, and so on. There's more details and nuances but this is very doable.

How Mullvad handles your data, for some inspiration: https://mullvad.net/en/help/no-logging-data-policy

  > Conversations are kept around because they can be picked up and continued at any point (I use this feature frequently).

I'm not sure why this is a problem. There's no requirement that data at rest needs be unencrypted. Nor is there a requirement that those storing the data need to have the keys to decrypt that data. Encrypted storage is a really common thing...

  > Additionally you can use conversations in their scheduled notification feature, where the conversation is replayed and updates are sent to you, all done on the server.

For this we can use the above scenario, or we can use a multi-key setting if you want to ping multiple devices, or you can have data temporarily decrypted. There is still no need to store the data to disk unencrypted or encrypted with keys OAI owns.

Of course, I also don't see OAI pushing the state of Homomorphic Encryption forward either... But there's definitely a lot of research and more than acceptable solutions that allow data to be processed server side while being encrypted for as long as possible and making access to that data incredibly difficult.

Again, dive deep into how Mullvad does it. It is not possible for them to make all their data encrypted, but they make it as close to impossible to get, including by themselves. There doesn't need to be a perfect solution, but there's no real reason these companies couldn't restrict their own access to that data. There's only 2 reasons they are not doing so. Either 1) they just don't care enough about your privacy or 2) they want it for themselves. Considering how OpenAI pushes the "Scale is All You Need" narrative, and "scale" includes "data", I'm far more inclined to believe the reason is option 2.

[0] Remember, this isn't so much a conversation in the conventional sense. The LLMs don't "remember". You send them the entire chat history in each request. In this sense they are Markovian. It's not like they're tuning a model just to you. And even if they were, well we can store weights encrypted too. Doesn't matter if a whole model, LoRA, embeddings, or whatever. That can be encrypted at rest via keys OAI does not have access to.

fenomas · 2025-11-13T12:48:07 1763038087

Services like Mullvad and Signal are in the business of passing along messages between other parties; messages the service isn't a party to. With chatgpt chat histories, the user is talking directly to the service - you're suggesting the service should E2EE messages to and from itself, to prevent itself from spying on data generated by its own service?

N-Krause · 2025-11-13T12:07:43 1763035663

You cannot compare these examples. There is currently no way to encrypt the user message and have the model on the server read/process the message without it being decrypted first.

Mullvad and E2EE Messengers do not need to process the contents of the message on their server. All they do is, passing it to another computer. It could be scrambled binary for all they care. But any AI company _has_ to read the content of the message by definition of their service.

subscribed · 2025-11-13T18:51:43 1763059903

It's a solved problem. Lumo.

N-Krause · 2025-11-14T06:51:01 1763103061

Lumo never promises encryption while processing a conversation on their servers. Chats HAVE to be decrypted at some point on the server or send already decrypted by the client, even when they are stored encrypted.

Read the marketing carefully and you will notice that there is no word about encrypted processing, just storage - and of course that's a solved problem, because it was solved decades ago.

The agent needs the data decrypted, at least for the moment, I know of no model that can process encrypted data. So as long as the model runs on a server, whoever manages that server has access to your messages while they are being processed.

EDIT: Even found an article where they acknowledge this [0]. Even though there seems to exist models/techniques that can produce output from encrypted messages with 'Homomorphic Encryption' [1], it is not practical, as it would takedays to produce an answer and it would consumes huge amounts of processing power.

[0] https://proton.me/blog/lumo-security-model

[1] https://en.wikipedia.org/wiki/Homomorphic_encryption

pmontra · 2025-11-13T14:07:27 1763042847

> Models should run in ephemeral containers where data is only processed in RAM

Maybe, but letting aside that they are two different kind of products, how can you trust them to really do so? And in any way, in the case of ChatGPT where should I store my client side private key, as I use those bots only in my web browser? Maybe in my password manager and I copy paste it every time I start a new conversation.

My take is that if they went this way we would not be talking about them now, we would be talking about one of their competitors that didn't put hurdles between their product and their customers.

In other words, survivor bias.

nl · 2025-11-14T09:59:06 1763114346

As it happens...

I built E2E encrypted LLMs using secure enclaves, so I know a bit about this space.

The tech works, for small LLMs - the sort of thing you can run on your mobile already. It isn't yet (?) there for LLMs the size of ChatGPT.

cush · 2025-11-13T00:34:41 1762994081

People are responding in this thread as if ChatGPT is a one-on-one conversation with another person. The data isn’t “shared” with OpenAI. You’re chatting with OpenAI. ChatGPT is just a service. There’s no way to use ChatGPT without sharing all of your chats with OpenAI, that’s what the entire product is.

cush · 2025-11-13T00:27:08 1762993628

This doesn’t sound realistic. Signal is end to end encrypted and only sends one message at a time, while ChatGPT needs the entire chat context for every message and they need to decrypt your messages in their services in order to feed them into the LLM.

vorticalbox · 2025-11-13T11:30:39 1763033439

this is what proton are doing with lumo[0]

https://lumo.proton.me

thorum · 2025-11-12T22:23:58 1762986238

> Our long-term roadmap includes advanced security features designed to keep your data private, including client-side encryption for your messages with ChatGPT. We believe these features will help keep your private conversations private and inaccessible to anyone else, even OpenAI.

JCM9 · 2025-11-12T23:18:24 1762989504

This sort of thing is pretty trivial to implement from the start, they just chose not to because they wanted the data themselves

cruffle_duffle · 2025-11-13T01:06:59 1762996019

Hah. I seriously doubt it is even close to trivial. Especially when they are to exist on any device you use the service from.