I've had much, much better results with LDA than LSI. Give that a shot if you have a chance, you'll be blown away. Stop word ratios are important, and make the max number of tokens 500,000.
I've never liked these scrolling animations. You need too much precision to see a part of the page clearly, while with normal scrolling it wouldn't matter if the information you're reading is at the bottom or top of the screen.
Articles with more traffic are bigger. I computed the semantic similarity using LSI with python (gensim) You have to scroll down/right a bit!
http://similarityapi.appspot.com/graph/?title=blade%20runner
There is also a JSON api: http://similarityapi.appspot.com/api/v1/?limit=100&title...
All feedback is appreciated:
@lucamartinetti luca@luca.io