What makes you love Google is Google giving you the right answer.
If you live in the US, do a google search for dmv.
Did it pull the one from your state?
Do you really want an averaged across all searchers answer? California is probably the state the generates the most Google searches for dmv, but the California DMV is likely relevant iff you are in California.
This idea of one best search page for one query, independent of any other factors based on the user or users geography is really overly romantic and simplicistic.
On the other hand, the filter bubble stuff is a nice narrative, but it mostly makes for an interesting story rather than real problem. Having worked on search personalization at Google about 5 years ago, it's just hard to make that big impact on search results with personalization. "dmv" is a nice example where it's works brilliantly, but those ambiguous queries are pretty rare, and hence the more bread and butter stuff is still really driving most of the quality.
First of all, no, it doesn't give me the DMV for my state first.
And second, why would I do that when I can easily search for "$statename dmv" and have that search universally do the right thing regardless of location? I don't want hidden inputs to my search based on my location; I want everything that determines the results encapsulated in the query.
In the end, it might be optimal for the world to have multiple, specialized search engines/apps to please as many wants as possible.
Hackers prefer the latter. Everyone else? That's up in the air.
I might search for pizza in order to find a place to order it from. Or I might become curious, and search for pizza to learn about its origins and history. A personalized search would only answer the first option.
That becomes even more frustrating when you (temporarily) are in another country and suddenly get smacked in your face by incorrect location based search results. If google wants to really retain traction with people who want to search for what they type (don't even get me started on the "showing results for: $bshere, if you want to see results for what you searched for, click here"), then they will soon have to provide a switch for it.
I really hope that google goes that way but I think it only crop up when people working at google find themselves using a different search engine to find (esp. programming related) specific topics.
For the overwhelming majority of my typos, "$bshere" is exactly what I wanted to search for. I particularly appreciate this when I'm searching from my phone (where typos are more common). There have been times when it has gone the other way (searching for "WiDi" aka Intel Wireless Display is the most notable example I can remember), but those are rare enough (and understandable enough) that they don't bother me.
> we found that users typed the “+” operator in less than half a percent of all searches, and two thirds of the time, it was used incorrectly.
Check my arithmetic but that means only 1 in 600 searches used + correctly; 2 in 600 used it wrong, and the other 597 searches didn't use it at all.
0.5% == 3 in 600
2/3 == 66% = 2 in 3 are wrong
No it's not. What I loved about Google is giving me the ability to search the web. If my answer wasn't on the first page (or even the second), I'd refine my query until I would get it. Which was easy because Google was very deterministic, just your basic "AND" keyword search engine. Try a few synonyms maybe, different angle, synecdochical approach and if all that didn't get you your answer, it probably wasn't there.
Today, you type in your query, Google tries to guess what you need, and shows you your results. If your answer is not there, you can try different queries, synonyms, whatever, but you'll get mostly the same results again. The web being unimaginably bigger than 10 years ago when I used the approach described above, I refuse to believe my answer's not there. Google is just being dumb.
Anyone know of a proper Search Engine? (I love Duck Duck Go but its index is rather small)
actually, you were probably not trying hard enough. Pressing on over the course of a week and getting some expert friends to help you would always show that your answer was in fact there. in some form or another.
Really it's hard to say because 9 times out of 10 it works fine. Probably even more. Because most of my queries are simple ones. I'd have to go dig through my browsing history, it'd take a lot of effort. I did it once, when Matt Cutts asked a similar question, but right now it's rather late here so I hope you can excuse me :)
One thing I can say, I have tried to diversify my usage of Search Engines in the past, to not just use Google but also Yahoo, Bing, and others. Even changed my default search engine sometimes. But it never stuck. Google just was too good--its speed was a big factor btw, even when the results weren't great, the fact that you could quickly try different queries made up for it. But since a couple of months I find myself more and more going for DDG at the first query--also because if DDG doesn't have my answer it has a convenient link to Google to retry, if Google would have the same I might even still use Google with DDG as a fallback instead :)
Yeah I'll investigate that tomorrow.
More generally, I think users appreciate when recommendations are transparent. Amazon and netflix are great about this - users get to implicitly choose when they want "recommended for you" or just basic results. I'd like Google search more if it had the same model.
A Google search for "florida dmv" returned Florida-based results.
What I am tired of seeing are Eames chair ads all over the place. Do one search for Eames chairs, and click on one paid result, and now all I see are Eames chair ads. Sigh.
At least it's a break from the casino school ads I was getting.
Then when you visit other google properties, google automatically gives you ads from that Eames chair site.
So you click on the relevant link ("happened" to be an ad,) as the Stanford MBAs high five each other at Googleplex. I remember reading a very interesting comment about the incentives of providing the best results and ad revenue. It's clear to me that ad revenue has been winning at Google.
I work on Search Quality at Google. We have a firewall between Search Quality and Ads. We don't answer to Ads. Revenue is just not my team's problem. We come into work every day and look at our search results and try to get the most relevant answers ranked as #1, #2, etc. I never think about the impact of my ranking improvements on ads. I've been personally involved in dozens of changes, and I've never once heard anyone argue for or against releasing a change because of ads.
I can understand why someone might be mistrustful because of the apparent incentive to sandbag the algorithmic results. But Google has the long-term view: if we can keep satisfying our users' needs, the revenue will follow.
But Google has the long-term view: if we can keep satisfying our users' needs, the revenue will follow.
Personally I no longer buy that. The revenue is now #1 and now Google is spending like crazy to keep the users. It started last year full speed
Then New York.
The only results for Texas DMV are two ads at the top of the result listing for sketchy drivers-license.org type "businesses".
I've been in TX 5 years now (and owned a car that whole time) and never even knew we had a department of motor vehicles.
Why is this getting down-voted? Is it because you disagree with what this person is saying? I thought down-votes were only for things that didn't contribute to the conversation.
It feels like this is getting down-voted because people don't like that Google is doing these things. So, this person is getting down-voted 'cuz of Google? Huh?
I'd hope a majority of users here feel the same way and act accordingly - pg's comments not withstanding.
There's a dangerous consequence to saying this. Here's how:
1. Voting is easy.
2. Voting is valueable: it literally changes HN for everyone else.
3. Agreement/disagreement is easy: we're taught how to do it as children.
4. Agreement/disagreement is value-less. Saying that you agree with pg on 'Hackers and Painters' is valueless. Saying "prior to reading H&P, I was in mindset A-B-C supported by points of view D-E-F, and here's how they were modified, and here's the new mindset H-I-J" conveys the same agreement, but it provides value.
We shouldn't incite people to use up/down votes to express something value-less. Agreement and lack thereof (alone) is valueless.
yeah, well.. you need to see my posts on Apple threads.
My state has a Motor Vehicle Division.
"DMV" gives me California's.
This page which does not appear on Google's first page: http://en.wikipedia.org/wiki/DMV - though perhaps it should.
> it's just hard to make that big impact on search results with personalization
Trying to make a big impact on something that was more or less working might be the problem. A "don't fix what's not (too) broken" could be a good idea.
For your dmv example, actually, I din't know what dmv is, and my fisrt page (french localized) is about a medicament dictionnary for doctors (DMV, didn't know it either).
"dmv usa" doesn't get me the dmvusa.org result firt either (it's the one I get on the US google)
This seems to be the best example of why geospecific data can be widly irrelevant, and using the default setting, I get this kind of unusable results for half of what I search.
But you explained the core of the problem.
There are two kinds of searches:
1) discovery of information
2) finding information I probably know but I'm just too lazy to go that site directly (or something in that sense)
The first kind of search is what made Google the most popular search engine. The second kind of search is something new for Google and it seems it is implemented as a unsophisticated simplification of the complex problem. It is like AOL search all over again.
In fact, I find myself spending less and less time on "Google the Search Engine" because it's gotten so damn good at giving me the right result right away based on the simplest and vaguest keywords (I use the omnibar so I don't even go to google.com anymore). I still remember not so long ago having to try different phrasings of my search keywords a number of times before I would get the result I was looking for on the main page (or in cases of extreme frustration clicking through pages and pages of search results). This is largely not the case anymore.
From my general interactions with people and how they use the web, anecdotal as it may be, they simply doesn't give a shit about ads, filter bubbles and so on. Google Search hasn't changed for them in any meaningful way except that it's gotten a lot better at figuring out what they wanted to find (and in most cases they are not even conscious of that improvement over time).
With regards to other other free G-services (Gmail, Docs, G+, Blogger, YouTube), if you are consciously buying into them, it is implied that you know you are entering into a relationship where parts of the real-estate you're interacting with might be used to display ads and I think everyone understands and accepts that.
I am an avid conspiracy theorist, however, and have more long-term concerns about what happens if Google turns TRULY evil and starts to use its data to spy on citizens on behalf of some authoritarian government. But as of right now I feel a little foolish reacting to this concern in any significant way.
I should stress that like any software developer (or any power user, probably), my queries tend to be very specific. I'm looking for particular information (which may or may not exist) that takes more than a couple of words to describe. It is with these queries that all too often Google simply decides that I meant something else and replaces one or two words with what it has decided are synonyms. Except they aren't.
Unfortunately I have not documented all these queries (I should have). But to give some examples, it would not be the first time that Google misinterprets a specific query about "sqlite" and only returns results related to SQL Server. Or it decides that a "scapegoat tree" is equivalent to an "AVL tree". They aren't synonyms. The results are actually worse than irrelevant.
It's like typing a query such as "honda brake problems" and getting results that are about "toyota brake problems". Yes, both are Japanese car manufacturers, but they are really not equivalent.
Of course correcting typing errors and searching for synonyms is useful. But I'd like to have control over that behaviour, and most of all: I want to understand what the search engine does. The best interfaces are those where a user can develop a mental model that helps them interact with it. It seems I'm no longer able to create a mental model for how Google works.
So who do you think should invest the extra effort into performing searches (aka adding quotations.. or "taking them off"): the hackers with specific, but edge-case search requirements, or the 99% of casual users who just want to learn about "how do helycopters fly"
P.S. I tried both the sqlite and the scapegoat tree searches without the quotes and Google seemed to "know" what I meant... but if you encountered that problem perhaps it's an algorithm that unnecessarily "kicks in" depending on some other circumstances of the search. I would definitely classify this as a problem, but one that should again be easily solved with the use of quotation marks
By the way, the scapegoat tree search query was "scapegoat tree inorder append". Try that and observe 90% AVL tree results on the first page.
1) Google focuses more on commercial sites, especially branded sites, than research or personal sites.
2) Sturgeon's law (90% of everything is garbage) and a much bigger world wide web means that there's a lot more garbage indexed, and returned, by Google searches.
I remember that I'd try a search phrase, and I'd be taken to some Web 1.2 site with black text on a grey background; some university professor or doctor would have written a really interesting article and linked to other sites or research. Now I get some reputable site selling stuff, or a bunch of shitty link farms offering me god-awful "reviews", or awful awful blogs.
I remember when search terms would return less than a gajillion hits; you could tell by the extra 'o' in Goooogle how many hits you'd got. When was the last time GoogleWhacking was possible? A result of "zero hits" is important. It doesn't mean the search failed and that Google should feel free to substitute words and try different stemming and remove the quotes. "Zero hits" information for me.
I strongly agree that I'm an edge case, and that Google is great for most people.
1. I already know what I'm looking for: a particular website, post, comment, the lyrics of a song that I know a part of.
2. I have a question, I don't know the answer, but I'll recognize it when I see it.
In my experience Google is getting better at type 1 '(re)discovery/popularity' search and worse at type 2 'exploratory/specificity' search. Depending on the ratio of type 1 vs type 2 in your search habits, Google results will get better/worse for you.
They introduced Verbatim search to solve the specifity problem, but in my experience it hardly ever works :(
I don't want to know what the "average" likes to read, or even what people like me like to read. I want to know what smart, interesting people (not me!) like to read.
In other words: I want curation by an expert, not the average of everyone's inputs. Google is supposed to know the right answer, not ask all my friends what the answer is and report back.
The thread recently where Jordan Mendelson (napster hacker) chimed regarding Napster's back end is the quintessential example of what I'm talking about.
I think Google is facing a SELF-DEFEATING reality that as they serve people with better results, they erode the perception of their brand in peoples eyes.
There is clearly an incongruity between what people "THINK" they want from a search engine( and how they perceive Google) versus, what they "REALLY WANT" from a search engine.
Conflicting priorities are always a struggle, but this is an especially interesting one. One side has a DATA supported reality that can't be disputed about what searchers want and what puts bread on the table.
On the other side is the more important, but less quantifiable reality that their BRAND is what brings searches back to the site, not the quality of the results, but the perception of GOOGLE and SEARCH being synonymous. This brand value is clearly getting diluted by delivering better results, and could in the long run be their demise.
I think the unfortunate endgame here is legislation that will require what people think they want (e.g. privacy, absolute control) at the expense of some things they actually want (e.g. frictionless social interaction, simplicity).
(To forestall some argument: Yes, I realize that privacy/social interaction and control/simplicity are not always mutually exclusive. However, there are cases where they interfere with each other, which requires compromise.)
It's a mediocre service now. Became big with search, became worthless for searches for me. I cannot search anymore, I can play games with software that tries to second guess my intentions.
Google is assuming that when people search, they want answers, but maybe what they really want is to SEARCH for answers.
However, because they are essentially a monopoly, many people try to game the system for their benefit. This creates an arms-race, and may be one of the root causes of this personalisation. I do agree with the main point, that this could potentially make Google less useful for us all as a whole, and will probably lead to their disruption, sooner or later.
This is a pop-science article, so I don't have high expectations.
Nonetheless, my criticism was directed at this article for not providing at least a few examples of the results of a filter bubble.
Serving personalized results to users may increase click-through rates but it degrades the idea we form of what Google is and what it does.
If it serves me better results, then how so?
A search engine used to be a tool for reaching out into the world and finding something new. Now it searches through what I already know? What's the point in that?
I always thought that search personalization was of course better, until that page made me think about the downsides.
Google makes its money from advertising, but in order for it to place ads it needs to serve relevant results; serving "tailored" results because they generate more clicks is short-sighted (well, that's the argument anyway). If people become so distrustful of Google that they stop using it, and use a combination of DDG or Wolfram Alpha for example, then there will be no more ads for Google to serve.
We're admittedly very far from this, but the fact that high-profile people such as PG start to complain about it publicly (and not just old anonymous grumps like me) should make Google think.
or Google could serve ads by developing a suite of web apps, or a mobile operating system, or a even a social networking site.
Google could even serve ads on other people's websites when people visit those websites without going through their search service.
Anyone know what proportion of their revenue comes from ad placement on their own product pages?
>Are you aware that Google runs an entire ad network?
Which would mean that they would want to keep such third party sites interesting, fast and high quality as well. Chrome, Analytics and their other similar products/services are hence justified.
Now, I feel like I'm living in the law of averages. I'm not interested in the average; I want the top of the scale. I really do not want social and local and all the other unknown contexts involved in giving me what's right "for me". With all the contextual crap, I feel like now I get the "less".
I wish they let me have the option of turning it all off, and just going with content-based search. Of course, they have to filter out the crap that's being generated to game their search index, but outside of that -- give me what the web has.
There is no alternative. Maybe DDG but results are not so good.
The problem of giving people what they want is also the problem of markets and democracy in general. Fortunately for us buyers/citizens, we usually eventually realize what we really want - unfortunately for the vendor/politician catering to what we previously thought we wanted.
There's an appeal to Jobs' approach: make what they will want once they've see it. Far superior to creating/governing by numbers.
This practice is being pushed _hard_ to all businesses here as a central theme of growth of the internet. In the sense that this drives better business, it certainly will be. Don't expect to go to Toyota.com and see the same experience as your neighbor anymore.
The only somewhat "break through" could be seen in local searches - and saving people from adding their zip code, city or street name to their search phrase seems to be not that much impressive either.
There is already an established term for this and it’s not “filter bubble” or “tailoring” the term is “relevance” and it’s the core of search.
If the author is suggesting that data about a user’s identity, history, and location isn't usefully to providing her with the best possible results then he should stick to writing about stuff he actually comprehends and not be the umpteenth author resorting to use “Google” in the title to whore traffic.
I find this so utterly naive that I am incredulous.
Google is a business. They don't tweek results based in favor of user preferences.
They tweek results based on the amount of revenue they are likely to bring in.
It's not an accident that [in the US] http://www.google.com/search?q=weather provides top links to commercial advertizing driven sites for one's local weather and only portals into the main page of weather.gov rather than the local forecast page.
Google search results aren't the best possible for the most people. They are the worst tolerable for maximizing revenue.
 do employees have to use 20% time for downvoting on HN or is it part of their job?
He complains they're not the same, that if you search from a given location or being in a given demographic you will get different results than other people, and that this fact erodes the trust users have in Google: how do I know if this set of results is the absolute best on a given topic or if Google is simply showing to me what it "thinks" I will like?
It is showing you the most amount of pertinent advertising that it thinks you will tolerate while promoting sites containing similar advertising as high up the page rank as it thinks you will tolerate.
All those links to your G+ friends are promoted to keep you on Google's properties where they can make the most money when you click on an advertising link - think of Google like a DisneyWorld's Magic Kingdom and G+ like Downtown Disney.
This is entirely false, and I wish misinformed people would stop parroting it. There's an extremely rigid firewall between the ad side of the company and the search side. People working on search don't even see metrics on how their changes effect ads, let alone make decisions based on them.
When I want change the way ranking works, there's a big stack of statistics I have to collect. None of them involve ads. All of them involve how useful the results are to a user.
The Weather channel utilizes Google search.
The Weather Underground...well name a Google service they could use and don't.
Accuweather provides plausible deniablity.
On the other hand, weather.gov employs some of the best meteorologists anywhere, provides the official hazardous weather warnings the other sites rely on, and has a wonderfully informative hourly weather graph for any given location.
In other words, Google doesn't provide the most relevant results first when it comes to local weather in the US, it provides the most profitable.
But I'm willing to read your statistical explanation for why this is so. Go ahead and make your case.
Be sure to explain how the special format for the weather page is based on statistical data and not hard coded.
I can't get into detail explaining what goes into our rankings to you. I can tell you that I can bring up a debug view that shows all the signals that enter in to this ranking, and none of them involve "uses google products."
We love for people to complain about queries. That feedback is very useful, and we always need more of it. But when our search results suck, the reason is that our system is too stupid, not that it's perversely smart at funneling you into money making opportunities.
The "weather" page is hand coded - the term triggers a unique piece of code that isn't used for any other search term.
"Pickles" doesn't draw a bunch of icons across the top of the search results and offer short form links to "Mt Olive," "delmonte" and "vlasic." The top link doesn't give me subdomains for MtOlivepickles.com in the way Google search does for weather.com.
When I query "weather," google sends my location to the commercial sites, but not to weather.gov. Explain that with statistics.
 At the risk of further cynicism, I'll explain it with the relationship between WeatherUnderground.com and Google: http://en.wikipedia.org/wiki/Weather_Underground_%28weather_...
Then don't think of it as stupidity then. Think of it as having poor proxies for quality that we have to mash together into a reasonable but imperfect system.
Here's a video that shows how all decisions about ranking changes get made: http://www.youtube.com/watch?v=JtRJXnXgE-A I've been in that room many times over the years, and I have never heard anyone mention or display any statistics that involve ads or revenue.
I think of it as good proxies for revenue. It just makes more business sense, and seems more in line with the fiduciary responsibilities of Google's board in regard to Google's stock holders.
And that's why I don't doubt your statements that the algorithm doesn't do this or that. There are proxies. If Billy Beene could do it a decade ago in Oakland, it is hard to see the smart folks in Mountain View depending on dumb luck to generate revenue from search. And that's the only alternative to tweeking the algorithm to generate revenue.
"q=weather" shows an observable point where hand coding for business interests is accessible and the that holes in the wall are deliberately created. It further shows a correlation between the hole and revenue - note that weatherunderground.com generates revenue outside the ad model.
It's difficult to believe that this practice isn't automated.
Look, I've worked in search at Google for over 5 years. I know all of the signals that determine how results are ranked and most of the details about how they are used. I know all of the metrics that are used to evaluate and tune ranking. I know nearly everyone who works in ranking, and I've seen a decent fraction of the launch decisions and the metrics supporting them.
I'm telling you, with no caveats, that we don't make ranking decisions based on statistics related to revenue. If that doesn't convince you, then I don't know what would.
I think of it as additional information Google supplies, I don't think of it as part of the search results.
I liked the old Google better. Today DDG has the simpler interface and gives me 50% more search results to choose from. Google looks more like the old Yahoo or Netscape portal every day.