Hacker News new | past | comments | ask | show | jobs | submit login
Alibaba releases an 'open' challenger to OpenAI's O1 reasoning model (techcrunch.com)
121 points by bn-l 15 days ago | hide | past | favorite | 7 comments



Bigger discussion from two days ago: https://news.ycombinator.com/item?id=42259184


Comments moved thither. Thanks!


full list of chinese o1 clones

- QwQ https://huggingface.co/collections/Qwen/qwq-674762b79b75eac0...

- Marco-o1 by AIDC, Alibaba (yes this is a different, less known team also from alibaba) https://huggingface.co/AIDC-AI/Marco-o1

- Skywork-o1 by Kunlun Tech https://huggingface.co/collections/Skywork/skywork-o1-open-6...

- DeepSeek-R1-Lite-Preview https://chat.deepseek.com

- InternThinker preview by Shanghai AI lab https://sso.openxlab.org.cn/login?redirect=https://internlm-...

- k0-math by Moonshot AI https://kimi.moonshot.cn

https://x.com/adinayakup/status/1861908631807017007?s=46

the main ones to watch are QwQ and r1.


Interesting take. Basically Open AI's O1 and every reasoning model's behaviour can be replicated by proper COT given that it is followed correctly. I learn COT from this amazing blog on I found on Reddit and then experimented and LLM was able to perform much better. So I believe Alibaba in their new model must have improved their system prompt when compared to their new model.

Blog Link: https://hub.athina.ai/blogs/what-is-chain-of-thought-prompti...



Why the scare quotes if it's apache 2.0?


The github readme has something unusual re licensing:

> All our open-source models, except for the 3B and 72B variants, are licensed under Apache 2.0.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: