MergeShuffle: A Very Fast, Parallel Random Permutation Algorithm

blt · on Aug 16, 2015

Wait, so the authors' method is only a tiny fraction faster than the parallel version of the more general Rao-Sandelius method, and uses more random bits? But they claim it's "very fast" and "extremely fast"? Am I missing something?

teraflop · on Aug 16, 2015

Yeah, the experimental results are pretty poorly presented, but on the other hand the abstract says: "While this algorithm is of lesser practical interest, we believe it may be of theoretical value."

The number of random bits used by both algorithms is within a few percent of the theoretically optimal value (4.6% more for Rao-Sandelius and 5.5% more for MergeShuffle) so I don't think that's really a significant concern.

dripton · on Aug 16, 2015

A fun read.

Does anyone know of a practical use for a very fast, parallelizable shuffle algorithm that uses few random bits? All the shuffling I've done has used small enough N that Fisher-Yates was just fine.

ctchocula · on Aug 17, 2015

I think using a parallelizable random permutation is useful in parallel versions of certain graph algorithms such as maximal independent set. [1]

[1] https://www.cs.cmu.edu/~jshun/mis.pdf

im3w1l · on Aug 16, 2015

I thought Fisher-Yates used the minimum amount of randomness possible?

darkmighty · on Aug 16, 2015

Note the minimum number of bits is ceil(n*log2(n)). This seems to achieve nlog2(n)+O(n).

teraflop · on Aug 16, 2015

Technically, the lower bound is log2(n!), which is slightly less than n * log2(n).

According to the table of experimental results, the algorithm described in this paper and two of its competitors can all do better than n * log2(n) bits.

darkmighty · on Aug 17, 2015

Ah yes log2(n!) is O(n.log2(n)) bits, not actually n.log2(n) bits. From Stirling's approximation* it seems it's more like n.log2(n)-n+O(log2(n)) bits.

https://en.wikipedia.org/wiki/Stirling%27s_approximation

davidshepherd7 · on Aug 16, 2015

Just in case the authors are reading the comments: the second paragraph of section 1.1 ends mid-sentence, and even worse is missing the closing parenthesis!