Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
cerebras: 450 tokens/sec llama 3.1 70B (theregister.com)
7 points by davidfiala 10 months ago | hide | past | favorite | 2 comments



Cerebras fails the "how many r's in strawberry" test. Grok is the only one who passed that test.

Going to be interesting to see the speed and accuracy keep increasing, cant imagine how fast/accurate things will be in a decade. Cant wait.


- 1,800tps on llama 3.1 8B

- 450tps on llama 3.1 70B

free chat interface is at: https://inference.cerebras.ai (requires login)




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: