Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Blows away any consumer GPU.

Nah. Do you have 1st hand experience with Strix Halo? At less than 1600€ for a 128GB configuration it manages >45 tokens/s with gpt-oss 120b. Which is faster than DGX Spark at a fraction of the cost.



Strix Halo has awful token prefill speed. Only suitable for very small contexts.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: