Hacker News new | past | comments | ask | show | jobs | submit login

No, because it is not a distillation, but an extension. A selling point of the model is using RL to push past the quality of the base model.



Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: