Maybe due to 2x slower matmul (well known bottleneck), as one of multiple factor... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

Lockal on May 29, 2024 | parent | context | favorite | on: Tinygrad 0.9.0

Maybe due to 2x slower matmul (well known bottleneck), as one of multiple factors? https://github.com/tinygrad/tinygrad/blob/v0.9.0/extra/gemm/...

There are multiple bounties just for it in https://docs.google.com/spreadsheets/d/1WKHbT-7KOgjEawq5h5Ic...

anthonix1 on May 29, 2024 [–]

I think the matmul issue is symptomatic of a much deeper issue.

It would be nice to see less whining and blaming AMD (PyTorch and llm.c actually work on 7900 XTX, and blow tiny grad out of the water in terms of perf!), and more just getting stuff to work.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact