Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Ask HN: Better Search For Wiki?
8 points by avner on Jan 3, 2009 | hide | past | favorite | 6 comments
I am looking for a better search solution for a wiki-based knowledgebase I manage.

In a nutshell, I am looking for something along the lines of google custom search but I need to be able to restrict access to the search results to my users only (kerebros).

Any suggestions?




Solr (http://lucene.apache.org/solr/) is a web search engine built on top of Lucene. It should get you most of the way there.


For free or for pay?

A Google Mini is about $3K (http://www.google.com/enterprise/mini/index.html). That's about as close to a private google custom search that you'll find. I used to admin one, I believe you could integrate it's authentication into LDAP or Active Directory.


The place where I work actually own extensive google search appliance but to get through inter-departmental red tape will be harrowing- not to mention I won't be able to admin it.

Looking for something free- ^^ Solr looks promising.


I run a few mediawiki's, and so far the search that comes with it has been adaquate.

For other sites I have experimented with CLucene, htdig, and maybe some others. A decent search can be put together in the course of a couple of days of careful attention.

Do you think that your current problem could be the basis of a paid-for service ? I.e., if you could pay a monthly or per-use fee, give credentials to a service that would then access the private data you wanted indexed, and then provide the search functionality in a way that only your server could fetch it (and then re-display it to the authenticated user), would you buy it ?

I have also thought about using open source software to make a search appliance similar to google's, and selling them or selling an installer CD that would configure an old PC to be one.


if you are using MySQL or Postgres

Sphinx Search Engine http://www.sphinxsearch.com/

it searches the database where you stored your documents. It doesn't crawl your website.

The quality of the search results is superior.


Sphinx is definitely easier to get going than Solr, but tends to be slower and doesn't scale as easily as Solr(Search on Lucene with Replication).

But these two are the best options, just depends on your needs/trade offs.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: