Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

 help



AFAIK Taalas, the company behind this demo, still only have their initially "hardwarized" model available to test in ChatJimmy, which IIRC is a rather stupid Llama 3ish 8b.

Don't get me wrong though, that demo is still incredibly impressive & makes me very much excited for the hardware-based model era (potentially) ahead.

Once you've experienced those speeds, you really start to think about the whole class of things that becomes possible; massively parallel decode paths, extensive reasoning loops, etc…


For scale though if three or four chips that size can replicate a Qwen 27B experience that'll be quite useful.

That’s the one.

The speed is incredible and fun to see, but the model is rather weak to the point where I’m not sure it’s particularly useful for most people.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: