What you can run locally are the distilled models, that is actually LLama and Qwen weights further trained on R1's output
reply