Hacker News new | past | comments | ask | show | jobs | submit login

0. Why so anti-intellectual?

1. The algorithm you're attempting to describe in 1) above is either exactly PageRank or an approximation to it, so I don't get the point you're trying to make.

2. The point of jargon and well studied algorithms is that you can recognize when your problem is a problem someone else has already tackled. In the specific case of PageRank, if you recognise it's an eigenvalue problem you can

1) Find efficient algorithms to implement it. In the case of PageRank there is a matrix factorization that can make the algorithm much faster.

2) Find convergence bounds for appoximate solutions such as the power method, so you can appropriately tradeoff compute time versus accuracy.

3) Find numerical techniques to avoid calculating a matrix full of NaNs.

4) Benefit from the decades of work in optimizing numerical linear algebra.

3. When Google was first released it was clearly significantly better than the competition (mostly AltaVista). I assume that was due to PageRank. I wouldn't be surprised if PageRank is much less important to Google now, but to get this point it had to have that initial advantage.




0. Hah, yeah. Pretty sure you’re the first person to accuse me of being anti-intellectual...I can’t say for certain, but I think this interpretation here might be on you.

1. Indeed I think you missed my point: the causality implied in your earlier post reads as “well, if you know math, PageRank pops right out!” — my point is, you don’t need to know anything about eigenvalues for PageRank to be an intuitive solution; the insight isn’t eigenvalues or even probability distributions of pages, it’s ranking by relative rank of inbound links. There are many ways to operationalize that insight, and your post made gatekeeping allusions about what math it takes to do that. That’s what I’m responding to.

2. Sure, but you were originally responding to a post about when graph algorithms might ever be useful in industry. Obviously all things being equal, knowing about graph algorithms > not knowing about them. But again, you don’t need to know about them in advance.

3. No doubt Google’s early success is due to PageRank. But PageRank’s success is due to the insight behind it — following web links backward (in fact the precursor to PageRank was called “backrub”) — and not due to eigenvalues.

To be clear, I’m not at all anti-intellectual, and I love a good math puzzle or algorithm as much as the next guy — and I said as much in my other post.

But I’m also not in love with the fetishization of formalisms that is common in technical and academic circles, and the attendant value system that is so eager to prove itself that it tries to fit any and all adjacent insights into its own paradigm — just as you’ve done here.


I wasn't using jargon for jargon's sake. The original question quoted this little passage:

"Graphs as matrices and the Laplacian of a graph. Interpretations of the largest and smallest eigenvectors/eigenvalues of the Laplacian. Spectral embeddings, and an overview of applications (e.g. graph coloring, spectral clustering.)"

My response used the same terms to quickly sketch how they relate to PageRank, and hence try to show how the theory relates to an application. If I was trying to explain PageRank to someone who wasn't familiar with the maths I would take a different approach, but this is a tiny little textbox and there are already many very nice visualizations and other explanations that one can Google.


Oh, I wasn't accusing you of using jargon for jargon's sake, and nor do I believe that you were -- you used the appropriate jargon appropriately and correctly! And certainly representing graphs as matrices and then applying matrix math to them is valuable, I am absolutely not arguing against that.

I'm just taking issue with the (probably tongue-in-cheek!) framing of your "quick sketch": rather than merely demonstrate the connection ("hey look, PageRank uses this stuff!"), the sketch implied that if one knew theory about graphs and their representation as matrices, then PageRank is pretty obvious and falls out of the math (and then you could start a US$trillions business!). I was content to just leave it at that with a snarky version of "hey, sure, this math is relevant, but it wasn't a requirement to come up with the PageRank insight" -- but then I got triggered by your "anti-intellectual" comment. :)

So now I feel compelled to say, in case any impressionable young minds read this thread: that sketch (which I likely grossly misrepresented above) is a fantasy portrayal of how the insights behind technologies like PageRank come about; matrix knowledge isn't necessary (but not unhelpful, of course) to create PageRank or similar things. I've met too many (not necessarily you!) who tell themselves some version of this story: "PageRank? That's just the largest eigenvector the edge matrix" followed by some variant of "Those guys just got lucky! / Those guys just commercialized some known math! / I do math, and math can be worth billions!"

That story ignores the actual insights required to get to PageRank, even if you already know the theory; it ignores the hard work of actually building a company and commercializing such an insight.

Anyway, thanks for coming to my therapy session. </soapbox>


> 0. Why so anti-intellectual?

I'm not the person you're responding to, and perhaps it was their tone more than their content that you're replying to, but...

I think there's a lot of value in a KISS philosophy, and using the simplest most accessible algorithms possible. I don't think that's anti-intellectual so much as minimalist. Less charitably, one could say that reaching for something more complex than is needed is navel gazing or gate keeping.

I'm far from an expert on the topic, so I'm not trying to assert if either of you is more correct than the other as far as what the more appropriate model is.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: