Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Introduction to the MinHash algorithm (tonicebrian.com)
5 points by archie on March 11, 2013 | hide | past | favorite | 1 comment


The introduction would have been better on a concrete example.

This algorithm is often used in a search engine to detect near-duplicate web-pages and to remove them from search results.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: