I wonder if someone can use the PageRank algorithm or something similar to determine which essays are the most prominent. It's hard to tell with this graph which is the most connected essay - is it "Taste for Makers", "What You Can't Say", "Great Hackers", or something else? It would also be interesting to correlate this with the Google PageRank for these pages, and traffic data (if it's available).
Here are the ones that get the most traffic, with daily page views:
1760 Stuff
770 What You'll Wish You'd Known
403 How to Start a Startup
365 Why to Start a Startup in a Bad Economy
349 Why Nerds are Unpopular
294 How to Do what You Love
274 Web 2.0
247 Great Hackers
207 How to Make Wealth
187 Lies We Tell Kids
"Stuff" is my sending the given link to my female relatives to dissuade them from, or to reduce their practice of, being packrats. I'm pretty sure there are other users out there that do the same...
I've just pulled the description from WikiPedia, implemented pretty much that, and that's what I did.
In essence, each page gets set to 1.0. Then distribute 80% of that down each outgoing link, and 20% to everyone equally. Lather, rinse, repeat.
I think that's pretty much the algorithm. Pages pointed to by pages with lots of juice get lots of juice. Pages pointed to by no one, or only pages with little juice, get little juice. It's linear algebra thought of as network flow.
Yes, but doing what you've done here gives the entire site,
whereas implementing it from the definition allows you to
compute the actual values, as I did.
Of course, Google have twoke their algorithm so it's no
longer exactly as they originally published, and they're
not telling anyone what it actually does. Using the
"clean" version is at least transparent.