Eric is right, but not for the right reasons. For starters, Apple's money is not...

deong · on Feb 10, 2015

There are still plenty of open problems in compilers. For instance, writing a program to effectively use all four of my CPU cores is pretty tedious. It would be awfully nice if my compiler could automatically parallelize operations, do effective register allocation across cores, distribute data for best use of L1 cache, etc.

Certainly researchers who are working on this sort of thing today are doing it in LLVM or some custom framework. I can't imagine GCC has any significant traction at least.

DannyBee · on Feb 10, 2015

The only open problem here is the one i stated "serious loop transforms". Parallelization is not even "hard", it's just hard for languages like C++. Fortran compilers have been parallelization for 20+years

The rest (effective cache/register/op usage) is all subsumed by cost models for polyhedral loop transforms. See PLUTO (http://www.ece.lsu.edu/jxr/pluto/) and PLUTO+ (http://dl.acm.org/citation.cfm?id=2688512)

sanxiyn · on Feb 10, 2015

Have we solved vectorization yet, or does that belong to loop transforms?

DannyBee · on Feb 10, 2015

For any interesting type of vectorization we want to do, the problem is not "can we figure out what to vectorize" it's "how long do we want to spend vectorizing code" :)