Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I'm not overly familiar with search as a field - why is stemming "critical to FTS" rather than a nice enhancement? I would have thought the defining criterion for "full text search" was that it searched the full text of the database?


The key is "with support for many languages". For many languages (other than English) with heavy use of suffixes and/or prefixes, FTS without stemming is kind of useless.


For many languages, it is even useless with stemming, you would want lemmatization plus a certain amount of fuzzy matching.


I didn’t quite understand what you meant. I saw your comma and thought you were developing another sentence and thought entirely. I understood you meant FTS search needs stemming to be useful after skipping he comma. Just FYI.


Yes, you understood correctly. English is not my native, sorry.


I used to be a heavy comma user. Still am. But try being ‘radical’ and not using a comma more often. (It’s not actually radical at all)


you don't need stemming for full text search, although it is a benefit for some text data.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: