One example is an end-grain cutting board that I made recently. For most things Ike that, the top 1000 are dominated by made-for-Pinterest blogs or major sites that aggregate low quality content that's good enough to get hits but not good for much more than that.
Similar extensions exist for firefox.
For example searching for how to grow a garden. I never want to see howtogrowagarden.com. I’d prefer to find more genuine l, non seo juiced advice.
Searching for least-relevant can be pretty random: it's easy to point at the center of a circle, but the edge of the circle is not a point
and since SEO is abused for pageviews i.e. apparent relevancy, this penalizes SEO abuse.
The venn diagram between SEO and user satisfaction is gradually being compressed into a circle by Google as they improve their algorithm.
SEO is already basically human oriented now- anyone selling mumbo jumbo SEO magic now is a crank. It used to actually work quite well.
I’d like to remove just the top 10K or 100K.
> buy hoes -sex -porn
I still have difficulty finding information I need for work from company Intranets
I still have difficulty finding really local information
I still have trouble finding news that is objective and not slanted or click bait
I still have difficulty finding recommendations on finding recommendations for good books to read
It seems to be like Google has really dropped the ball on search since they acquired "lock-in" through Gmail, Chrome sync, Android etc.
For the others SEO optimizations are a real problem, yes. You can only try alternatives, like searx.to, bing or asking around.
Elasticsearch uses Lucene under the hood, in my experience Lucene dominates the actual indexing and searching, although I'm not familiar with xapian.
By users, I reckon SharePoint search (FQL) is probably the biggest although it is way behind Lucene in features.
Google actually used to sell bright yellow branded racks that they would come and install on corporate networks to provide a "private Google" but I'm not sure if they still do.
I wouldn't trust Google locally neither, and it's expensive.
SharePoint search is unfortunately used too often, yes.
I believe you, I just can't find evidence online.
My application is not that big, around 10 million text files, but I would be interested in anything faster (or allowing more complex queries) than Lucene, which is what I use at the moment.
Where I see this being useful is searching for current events. For example a search for a local double-shooting I've been following returned some information I hadn't seen before. It would probably be good for them to focus on more news-oriented searching as that's where there's a serious echo chamber among the top websites.
But for me at least, a couple of rules are enough to solve 99% of the problems.
EDIT: You can actually achieve something similar with bookmarks, using keywords and '%s'
The idea is: take a link to a search query string of a search engine, and replace the query part with '%s'. For example, take the following search query on DuckDuckGo:
cute hedgehog -site:www.pinterest.com -site:boredpanda.com -site:amazon.com -site:etsy.com
ddg cute hedgehog
Look for an extension that highlights the good results. I find that more valuable than filtering the bad results.
As for Google, the limit to a query is 32 words, apparently:
... however, it also supports inurl:<query>, so you can easily filter out sites with manu subdomains (say, pinterest.com, pinterest.co.uk, etcetera), just by using -inurl:pinterest
Also, adding a Dark Theme to the settings would be nice! (I'm trying to minimise the amount of light I'm exposed to at night to reduce eye-strain)
Love the concept, seems to work well. I personally use the 'media' specification of Google often (mostly: images, videos, pdfs, scholar articles, and Google patents). I didn't see a way to filter on Million Short.
...and if you could find a way to filter content that is not behind paywalls that would be amazing. As an engineer I'm constantly searching standards/specs (e.g. IEEE, ASTM, ISO standards etc.) some of which you can find free copies of for the previous revision (which are still pretty good), but you have to dig deep through top-ranking paywall sites.
If it takes off, I wonder if the phenomenon will give a boost to affiliate sites. If you can't be there in the top results, align with those who can.
The article title is both, so we replaced it with representative language from the text.