New Nvidia driver adds 16bit float support to OpenCL [pdf]

Kelteseth · 2024-02-15T10:16:34 1707992194

Can someone explain why they update to Clang 7, that was released 19 September 2018?

kevingadd · 2024-02-17T06:35:21 1708151721

It's very very common for software like shader compilers or AOT compilers for other languages to be based on old clang/llvm forks.

peppermint_gum · 2024-02-17T09:30:29 1708162229

It doesn't seem it's new. The release notes for the 511.79 driver from February 2022 say the same thing:

https://us.download.nvidia.com/Windows/511.79/511.79-win11-w...

pixelpoet · 2024-02-17T10:25:27 1708165527

It was some kind of opt-in option (which I wasn't even aware of), now it's on by default / rolled out to everyone.

zbendefy · 2024-02-15T09:38:13 1707989893

It seems they are finally enable the 16bit float extension in their OpenCL implementation.

Aardwolf · 2024-02-17T08:37:45 1708159065

Is this the IEEE float16 type or the bfloat16 type? It doesn't say

my123 · 2024-02-17T17:56:03 1708192563

pixelpoet · 2024-02-17T10:25:59 1708165559

https://en.wikipedia.org/wiki/Half-precision_floating-point_...

bbcc90 · 2024-02-17T06:44:19 1708152259

any idea why this took them so long?

pixelpoet · 2024-02-17T07:00:00 1708153200

Because they have a near monopoly with Cuda and get away with dragging their heels due to lack of market pressure (everyone loves to lock themselves into Cuda and then complain about GPU prices), despite having been on the OpenCL committee (just like Apple with Metal conflict of interest).

Anyway, I'm very glad to see it and will be using it immediately.

dotnet00 · 2024-02-17T07:46:44 1708156004

Also because none of the competition were/are serious about OpenCL either.

AMD still doesn't have OpenCL 3.0 support and their implementation of previous versions was far far less stable than CUDA.

I can't find a definite source on this, but afaik none of the official OpenCL implementations have ever fully supported mixed CPU-GPU code the way CUDA does.

pixelpoet · 2024-02-17T10:16:10 1708164970

AMD's OpenCL support on GPU is overall excellent in my experience (2x commercial apps and lots of hobby code), but I tend to mostly use low level OpenCL 1.1 stuff, which I find sufficient.

Also Intel GPUs are actually incredibly competent with OpenCL if you give them wide enough NDrange, and somehow try to look past lack of any fp64 support at all :/

pjmlp · 2024-02-17T08:38:59 1708159139

On top of that even the "do no evil" Google, never supported OpenCL on Android, pushing instead their own dialect, Renderscript.

Yes, there are some custom Android deployments that have a libopencl.so kind of thing, it is used by the OEMs themselves, and never exposed as official Android API.

nomel · 2024-02-17T07:47:48 1708156068

This is why I am investing in AMD. They can only improve!

my123 · 2024-02-17T17:57:34 1708192654

CUDA itself only got bumped from LLVM 5 to LLVM 7 in CUDA 11.2 (https://developer.nvidia.com/blog/boosting-productivity-and-...).

LLVM 7 opt-in for OpenCL happened some time later (available since r510). What changes now is that LLVM 7 is the new default.