Wow, this is beautiful C++ code!
High level abstractions, not hiding any of the nitty gritty details, parallelism and all, yet very readable. Well done!
I felt the same - not that worse than in a modern language (of course someone should rewrite in Go with the new algorithm so we can decide if it would be good enough :)