Hacker News new | past | comments | ask | show | jobs | submit | from login
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving (github.com/kvcache-ai)
8 points by sarkory 3 days ago | past | discuss
Show HN: KTransformers:671B DeepSeek-R1 on a Single Machine-286 tokens/s Prefill (github.com/kvcache-ai)
14 points by sssummer 71 days ago | past
Show HN: KTransformers–236B Model and 1M Context LLM Inference on Local Machines (github.com/kvcache-ai)
20 points by sssummer 7 months ago | past | 3 comments
Mooncake: A KVCache-Centric Disaggregated Architecture for LLM Serving (github.com/kvcache-ai)
13 points by zinccat 9 months ago | past

Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: