Ban warnings fly as users dare to probe the "thoughts" of OpenAI's latest model

sippeangelo · 2024-09-19T10:54:57 1726743297

I can't claim that I have any idea of how this model is built, but from their shifty excuses touching on "alignment" I'm confident that o1 is actually two copies of the same model, one "raw" and unchained that is fine-tuned for CoT, and one that has been crippled for safety and human alignment to parse that and provide the actual reply. They have finally realized how detrimental the "lobotomizing" process is to the models general reasoning, and this is their solution. It makes sense that they are afraid to unleash that onto the world, but we've already seen the third "filter" model that summarizes the thoughts to slip some of that through (just yesterday it was seen to have "emotional turmoil" as one of the reasoning steps), so it's just a matter of time before it makes something crazy slip through.

staticman2 · 2024-09-19T17:37:08 1726767428

I'm not convinced by your argument. If this was true we would expect the unofficial "uncensored" Llama 3 finetunes to outperform the official assistant ones, which as I understand it isn't the case.

It also doesn't make sense intuitively, o1 isn't particularly good at creative tasks, and that's really the area where you'd think "censorship" would have the greatest impact, o1 is advertised as being "particularly useful if you’re tackling complex problems in science, coding, math, and similar fields."

amenhotep · 2024-09-19T20:59:35 1726779575

Uncensored finetunes aren't the same thing, that's taking a model that's already been lobotomised and trying to teach it that wrongthink is okay - rehabilitation of the injury. OpenAI's uncensored model would be a model that had never been injured at all.

I also am not convinced by the argument but that is a poor reason against.

staticman2 · 2024-09-19T22:16:35 1726784195

I'm talking about taking the Llama 3 base model and finetuning it with a dataset that doesn't include refusals, not whatever you mean by "taking a model that's already been lobotomized".

It's interesting that you weren't convinced by the above argument but still repeated the edgelord term "lobotomized" in your reply.

errantspark · 2024-09-20T00:27:49 1726792069

The claim is that llama is "lobotomized" because it was trained with safety in mind. You can't untrain that by finetuning. For what it's worth the non-instruct llama generally seems better at reasoning than instruct llama which i think is a point in support of OP.

staticman2 · 2024-09-20T14:44:03 1726843443

Better at reasoning based on benchmarks or what?

mensetmanusman · 2024-09-20T12:31:06 1726835466

That’s one hypothesis, but the honest answer is that no one knows. This technology is too new, and the effects on the knowledge graph of censoring some sub components is too complicated to currently grasp.

stuckinhell · 2024-09-19T13:49:18 1726753758

We need more open source AI models.

me_me_me · 2024-09-19T14:46:56 1726757216

or maybe the opposite

Who knows, if you are not advocating for everyone to have access to nukes

Suppafly · 2024-09-19T14:59:19 1726757959

>Who knows, if you are not advocating for everyone to have access to nukes

Is there a non-stupid way to make that sentence make sense in the context of this thread?

selfhoster11 · 2024-09-19T17:13:51 1726766031

If unstoppable corporations had literal nukes, I see no reason why it would be hypocritical to wish for private individuals to have them too.

mmh0000 · 2024-09-19T16:46:52 1726764412

Yeah! Text autogenerated from a computer's probability engine will lead to people having "wrong thoughts"!

We should ban libraries and books too! I wouldn't want people to have an opportunity to learn for themselves.

On a less sarcastic note. No, text and images can not hurt you. All of this censorship and "safety" silliness is attempted moat building that needs to stop. Thankfully, if you search around a little you can find uncensored[1] models

[1] https://ollama.com/search?q=uncensored

[2] https://ollama.com/library/llama2-uncensored

ChrisArchitect · 2024-09-19T05:35:12 1726724112

[dupe] https://news.ycombinator.com/item?id=41534474

Art9681 · 2024-09-20T02:31:44 1726799504

I am getting the "Your request was flagged as potentially violating our usage policy. Please try again with a different prompt." for a custom Golang RAG workflow that has nothing to do with OpenAI. I can send the same exact prompt to GPT-4 and it will happily respond. But if I send it to GPT-o1-mini, I always get the violation warning.

What is going on?!

57546gwg · 2024-09-19T10:56:52 1726743412

openai is yahoo in ten years, change my mind

Duximo · 2024-09-19T14:44:11 1726757051

interesting...who will be Google in this case?

xerox13ster · 2024-09-19T15:21:59 1726759319

The first team to start indexing data so it’s properly searchable again.

nojvek · 2024-09-20T00:07:51 1726790871

Perplexity

CamperBob2 · 2024-09-19T17:44:21 1726767861

Honestly, Bing is kicking Google's ass in the most basic search tasks these days, and I never thought I'd see that happen. Seeing Microsoft neglect and degrade their bread-and-butter OS while genuinely improving in search makes me feel like I woke up on the wrong side of the rabbit hole.

Some people at the top seriously need to be fired from Google. Working on advanced language models is all well and good, but not at the expense of maintaining the company's core competencies.

bigfatkitten · 2024-09-24T01:50:42 1727142642

Has Google merely neglected search, or are they actively working to make it worse?