Loved this from the webpage intro:
"We are sorry to have to mention this point, but we have evidence that other items we have published on the Web have been appropriated and republished under other names. It is easy to detect such misuse, by the way, as you will learn in Chapter 3."
Glusterfs is an interesting take on the DFS concept and it is open source.
Also see the tutorial "Scaling Up Machine Learning" at KDD2011: http://hunch.net/~large_scale_survey/
Some interesting material in the presentations and the homeworks as well, although the bulk of the content is definitely in the textbook.