Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Comparing Julia to performance portable parallel programming models for HPC [pdf]
42 points by leephillips on Nov 17, 2021 | hide | past | favorite | 6 comments


I hope LLVM gets better at using AVX-512 instructions efficiently. That looks like the main pain point found here. It's exciting to see that for the most part Julia is roughly matching performance of the mature HPC solutions.


I think the issue with code vectorization is that the compiler must know that the given loop can run out-of-order. I don't know if that is something that LLVM can do reliably.


In cases where Julia is already emitting 256 bit instructions, LLVM has already figured out that the loop isn't order dependent.

The main issue here is that LLVM thinks AVX512 is slow (which it sometimes is).


The link exposes a login and a password. Dunno if this is intended.


It's SC's open access thing, the credentials are also available on https://www.dcs.warwick.ac.uk/pmbs/pmbs/PMBS/Welcome.html


This is a honeypot for comments without clicking. ;-)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: