So, HIP at a raw level is as performant as CUDA. The real problems come from hig...

So, HIP at a raw level is as performant as CUDA. The real problems come from higher level stack (BLAS, LAPACK libraries for example). But not all software need higher level stack. So, then it becomes a cost benefit analysis.

A 15k AMD part vs a 60k nvidia part. For 100 Nvidia GPUs, you can buy 200 AMD GPUs and at least 2-3 engineers for 3 years at 300k to fix the specific library for that GPU. If you can make that work for a lower level library right now, then it makes to sustain it in future.