I haven't kept up in this space, but there was a bunch of papers under "cache agnostic algorithms" that were actually taking the concept of cache into account. The cache agnostic here being regardless of cache size, sort of treating cache in the O() sense.
You could still beat them by tweaking for your hardware. But for the first time there was research into why Q sort is faster than heap sort
You could still beat them by tweaking for your hardware. But for the first time there was research into why Q sort is faster than heap sort