Going to be interesting to see the speed and accuracy keep increasing, cant imagine how fast/accurate things will be in a decade. Cant wait.
- 450tps on llama 3.1 70B
free chat interface is at: https://inference.cerebras.ai (requires login)
Going to be interesting to see the speed and accuracy keep increasing, cant imagine how fast/accurate things will be in a decade. Cant wait.