Only the R1 671B model (aka just plain 'R1') has the censorship being discussed ...

femto · 2025-01-29T00:27:30 1738110450

Is it true to say that there are two levels of censorship at play here? First is a "blunt" wrapper that replaces the output with the "I am an AI assistant designed to provide helpful and harmless responses" message. Second is a more subtle level built into the training, whereby the output text skirts around certain topics. It is this second level that is being covered by the "1,156 Questions Censored by DeepSeek" article?

Springtime · 2025-01-29T00:43:23 1738111403

The Deepseek hosted chat site has additional 'post-hoc' censorship applied from what people have observed, if that's what you're referring to. While the foundational model (including self hosted) has some just part of its training which is the kind the article is discussing, yes.

femto · 2025-01-29T02:56:42 1738119402

Thanks for cutting through the noise. I did some poking around and a discussion from a couple of days ago reached the same conclusion.

https://news.ycombinator.com/item?id=42825573

xinayder · 2025-01-29T02:26:49 1738117609

I asked about Taiwan being a country on the hosted version at chat.deepseek.com and it started generating a response saying it's controversial, then it suddenly stopped writing and said the question is out of its scope.

Same happened for Tiananmen and asking if Taiwan has a flag.

arnaudsm · 2025-01-29T00:52:52 1738111972

I disagree, I observed censorship at the RLHF level on my local GPU, at 1.5B, 8B (llama) and 7B (qwen). Refuses to talk about Uyghurs and tiananmen 80% of the time