The latest models create information from base models by randomly creating candi...

esafak 38 days ago | parent | context | favorite | on: OpenAI says it has evidence DeepSeek used its mode...

The latest models create information from base models by randomly creating candidate responses then pruning the bad ones using an evaluation function. The good responses improve the model.

It is not distillation. It's like how you can arrive at new knowledge by reflecting on existing knowledge.