Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Cerebras Inference now runs Llama 3.1-70B at 2100 tokens/s
(
cerebras.ai
)
6 points
by
cs-fan-101
8 months ago
|
hide
|
past
|
favorite
Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: