Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
MoonshotAI unveils Kimi's large-scale LLM serving architecture (arxiv.org)
18 points by slothfulhamster on July 2, 2024 | hide | past | favorite | 1 comment


I have been wondering the reason why online generative AI can serving so many requests. This really gives me an explanation.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: