> Copying Data from Host to Device Surprised there's no mention of async copies ...

nine_k · on Oct 21, 2023

Since you likely use 64-bit (double) floats, not every GPU would help much, especially compared to a beefy CPU.

But if you use a GPU with a large number of FP64 units, it may speed things up a lot. These are generally not gaming GPUs, but if you have a 4060 sitting around anyway, it has about 300 GFLOPS FP64 performance, likely more than your CPU. Modern CPUs are mighty in this regard though, able to issue many FP64 operations per clock per core.

01100011 · on Oct 22, 2023

Did you reply to the wrong comment?