Yeah it’s a copy of o1 easier than doing SOTA work

ein0p · 2025-01-20T22:53:44 1737413624

How do you "copy" something like that if OpenAI did not disclose any of the details?

luma · 2025-01-20T23:10:27 1737414627

Use OAI to create synthetic data for your training, which is clearly what they are doing. This is why their models claim to be ChatGPT when asked.

sangnoir · 2025-01-21T10:12:20 1737454340

xAI did/does the same, but Grok is nowhere near as good. Perhaps a measure of talent is required to "copy" as well as DeepSeek.

nialv7 · 2025-01-21T00:59:45 1737421185

that's not how this works. o1's thinking trace is hidden, and that's what's valuable here, not the output.

dcreater · 2025-01-21T00:17:18 1737418638

So? Every other model maker is doing that. Including OAI

There's a lot more to making foundation models and Deepseek are very much punching well above their weight