Which underlying algorithm? The search engine was built on a ternary tree with a...

Mizza · on March 17, 2012

Wow, that's incredibly fascinating! Thanks for posting such a great reply! You built something that really changed the world, in my opinion, and you should be very proud.

I'm particularly pleased by the use of ASN/distancing weights in your results! None of the BT trackers I've hacked around on have had anything like that in them.

Where is the code now? It should be in a museum.

Aloisius · on March 17, 2012

No problem! It occurred to me that I've never really talked about the Napster architecture or algorithms publicly.

I once hacked up Transmission to re-prioritize based on network distance. It worked rather well when it discovered over the DHT. It is a lot easier when you only have to calculate distance between you and other networks. Storing the graph of distances between two arbitrary users is harder.

The code was part of the assets that were bought out of bankruptcy by Roxio (who renamed themselves Napster). I doubt it is being used for anything.

tuna · on March 17, 2012

you should definitely publish this kind of things somewhere. it was big way before the whole big data stuff around.

cheers.

prawn · on March 18, 2012

Loved this:

"...it was completely unnecessary to use a stemming algorithm because if you typed in a misspelled search, given the size of our index, there was probably someone who tagged their file using the same misspelling."