Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Google Toolbar data and the actual surfer model (glinden.blogspot.com)
7 points by prakash on July 7, 2008 | hide | past | favorite | 4 comments



Mostly speculation, but they do make a good case. Current Google PageRank incarnations are most likely long past "random surfer" models. Google simply has too much data available to be able to resist using it to improve its performance (and bottom line).

The problem, of course, is that the only way a google-killer will ever arise will be with a completely different approach and not just a data-driven refinement on the current one.


They use all the data they get from every source they can get and they link it all together any way they can. I have no inside knowledge, but that seems pretty clear cut. What's to speculate?


What's to speculate? Plenty.

Are they still using PageRank or something completely different?

If they modified it, how did they do it?

Do they really use all available data? Or does the "don't be evil" mantra prevents them from using specific datasets?

What is their current "surfer model"? How random is it?

Until we have answers to these and other questions, I would say there is plenty left to speculate about.


Do they really use all available data? Or does the "don't be evil" mantra prevents them from using specific datasets?

I suspect Google believes using more data is always "good", so it's impossible for "do no evil" to mean "don't use data we legally collected". Does not compute.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: