CUDA is a bit of a well-trodden ground, you aren’t going to do much better there...

neonsunset · 2024-05-18T19:39:05

My understanding is it's less about competing with cuBLAS and cuDNN directly but rather offering the features they expose in a better and more idiomatic way - there's a reason it's less fun and more tedious to write C++ AMP code.

ein0p · 2024-05-18T20:29:00

Why would anyone write C++ AMP code when AMP is deprecated, and e.g. Triton exists though?