Hacker Newsnew | past | comments | ask | show | jobs | submit | AustinCarrBW's commentslogin

You're misreading. The article is referring to V3 when it cites the base model behind R1. It does not say R1 is the base model.


This is in the Businessweek story


The article specifically says "it's likely this sum referred only to the final training run—a data-refinement process that transforms a model’s previous prototypes into a complete product—but many people perceived it as an insanely low budget for the entire project." The article also delves into the SemiAnalysis report, as well as denials from ex-DeepSeek employees.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: