Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I wonder if someone can use the PageRank algorithm or something similar to determine which essays are the most prominent. It's hard to tell with this graph which is the most connected essay - is it "Taste for Makers", "What You Can't Say", "Great Hackers", or something else? It would also be interesting to correlate this with the Google PageRank for these pages, and traffic data (if it's available).


Here are the ones that get the most traffic, with daily page views:

     1760 Stuff
      770 What You'll Wish You'd Known
      403 How to Start a Startup
      365 Why to Start a Startup in a Bad Economy
      349 Why Nerds are Unpopular
      294 How to Do what You Love
      274 Web 2.0
      247 Great Hackers
      207 How to Make Wealth
      187 Lies We Tell Kids


"Stuff" is my sending the given link to my female relatives to dissuade them from, or to reduce their practice of, being packrats. I'm pretty sure there are other users out there that do the same...


You're right. There are rarely referring urls for it, which implies that it spreads primarily by email.



URLs are malformed. Otherwise, nice. When you say it's "similar" to PageRank on your webpage, what do you mean?


Re malformed - fixed.

I've just pulled the description from WikiPedia, implemented pretty much that, and that's what I did.

In essence, each page gets set to 1.0. Then distribute 80% of that down each outgoing link, and 20% to everyone equally. Lather, rinse, repeat.

I think that's pretty much the algorithm. Pages pointed to by pages with lots of juice get lots of juice. Pages pointed to by no one, or only pages with little juice, get little juice. It's linear algebra thought of as network flow.

Well, that's what I did.


In PageRank order: http://www.google.com/search?q=site%3awww.paulgraham.com

Google has a good PageRank implementation, though it isn't open source.


> In PageRank order: > http://www.google.com/search?q=site%3awww.paulgraham.com > Google has a good PageRank implementation, though it > isn't open source.

Yes, but doing what you've done here gives the entire site, whereas implementing it from the definition allows you to compute the actual values, as I did.

Of course, Google have twoke their algorithm so it's no longer exactly as they originally published, and they're not telling anyone what it actually does. Using the "clean" version is at least transparent.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: