Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Cerebras Inference now runs Llama 3.1-70B at 2100 tokens/s (cerebras.ai)
6 points by cs-fan-101 8 months ago | hide | past | favorite



Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: