Hacker News new | past | comments | ask | show | jobs | submit login
How Shazam Works (2003) [pdf] (ismir.net)
117 points by goranmoomin 11 months ago | hide | past | favorite | 29 comments



Merlin (Bird ID) should rebrand to Shazam for Birds or something to that effect, becuase thats what it is. It always blows ppls minds, and they always say something to that effect about the name.

It runs locally so make sure you install and download over wifi (bird models for west coast about about 800mb last I checked.

https://merlin.allaboutbirds.org/


I love how this works completely offline. Could this blast from the past idea be the future of software please?


Merlin should obviously be called "XKCD 1425"[1] rather than "Shazam for Birds" if they're intent on renaming to something less distinctive

[1]: https://xkcd.com/1425/


Tweeter


It feels even more impressive to me because it's always used outside and picks up multiple birds easily. Our current record is 14, and we live near a busy road.


Such impressive engineering behind it. I remember using Shazam for the first time and thought it was an absolute game changer. I still use it through Siri to identify songs I hear out in the wild.

Didn't realise it existed all the way back to 2003, which makes it all the more impressive


Right? That it worked even with pretty terrible GSM encoding at the time is pretty dang magic to my mind even after reading these slides.


One of the few apps that still feels like magic.


Originally designed for calling it on your landline or flip phone to concretize the experience of the time.


It was like magic tool for me back in the days


2002 even!


So fundamentally just a ton of locality sensitive hashing, cleverly chosen to match the domain. Still wild that they shipped this in the flipphone era.


Incredibly done in 2003.

There's a python library called dejavu[0] that does similar hashing. It mostly works to make your own Shazam (assuming you can source the data to hash!)

[0] https://github.com/worldveil/dejavu



Only issue I’ve had when playing around with Dejavu is the query speed slows linearly as the library size increases.


Yup, agreed. Currently at 4 billion hashes and matching takes 30-120s. I haven't put effort into speeding it up yet because I'm using it for a trivia contest and we have a couple hours to figure out the clips.


Related: https://news.ycombinator.com/item?id=32530056 "Source code example of the Shazam algorithm" et. al.


Related. Others?

How does Shazam work? (2022) - https://news.ycombinator.com/item?id=38531428 - Dec 2023 (154 comments)

An Industrial-Strength Audio Search Algorithm (2003) [pdf] - https://news.ycombinator.com/item?id=33299853 - Oct 2022 (1 comment)

Creating Shazam in Java (2010) - https://news.ycombinator.com/item?id=32530056 - Aug 2022 (36 comments)

Shazam turns 20 - https://news.ycombinator.com/item?id=32520593 - Aug 2022 (227 comments)

How Shazam Works (2015) - https://news.ycombinator.com/item?id=23806142 - July 2020 (7 comments)

Designing an audio adblocker - https://news.ycombinator.com/item?id=18855029 - Jan 2019 (186 comments)

Show HN: A radio/podcast adblocker featuring ML and Shazam-like fingerprinting - https://news.ycombinator.com/item?id=18459058 - Nov 2018 (2 comments)

Apple has completed its acquisition of Shazam - https://news.ycombinator.com/item?id=18066724 - Sept 2018 (316 comments)

Apple Buys Shazam to Boost Apple Music - https://news.ycombinator.com/item?id=15899065 - Dec 2017 (156 comments)

Apple is close to acquiring Shazam, sources say - https://news.ycombinator.com/item?id=15881896 - Dec 2017 (292 comments)

Show HN: Shazam-like acoustic fingerprinting of continuous audio streams - https://news.ycombinator.com/item?id=15809291 - Nov 2017 (76 comments)

How Shazam Works (2015) - https://news.ycombinator.com/item?id=15350729 - Sept 2017 (13 comments)

Tell HN: Shazam picks up song from my kitchen light - https://news.ycombinator.com/item?id=11593305 - April 2016 (2 comments)

How Shazam works - https://news.ycombinator.com/item?id=9870408 - July 2015 (48 comments)

Patent infringement claim re: “Creating Shazam in Java” blogpost (2010) - https://news.ycombinator.com/item?id=9594480 - May 2015 (18 comments)

The Shazam Effect (2014) - https://news.ycombinator.com/item?id=9593429 - May 2015 (37 comments)

The Shazam Effect - https://news.ycombinator.com/item?id=8634357 - Nov 2014 (34 comments)

Ask HN: Is there an audio search technology that finds exact and similar audio? - https://news.ycombinator.com/item?id=8420141 - Oct 2014 (3 comments)

Source code example of the Shazam algorithm - https://news.ycombinator.com/item?id=5724442 - May 2013 (16 comments)

Creating Shazam in Java - https://news.ycombinator.com/item?id=5723863 - May 2013 (43 comments)

An Industrial-Strength Audio Search Algorithm (Shazam) - https://news.ycombinator.com/item?id=2621103 - June 2011 (4 comments)

Shazam's Search for Songs Creates New Music Jobs - https://news.ycombinator.com/item?id=2215295 - Feb 2011 (1 comment)

How does the music-identifying app Shazam work its magic? - https://news.ycombinator.com/item?id=2214992 - Feb 2011 (2 comments)

How Shazam Works To Identify (Nearly) Every Song You Throw At It - https://news.ycombinator.com/item?id=1727891 - Sept 2010 (1 comment)

Implementing Shazam with Java in a weekend - https://news.ycombinator.com/item?id=1702975 - Sept 2010 (23 comments)

Shazam: not magic after all - https://news.ycombinator.com/item?id=909263 - Oct 2009 (28 comments)

How does the music-identifying app Shazam work its magic? - https://news.ycombinator.com/item?id=893353 - Oct 2009 (16 comments)

Shazam Has 50 Million Users and Secures Investment From KPCB - https://news.ycombinator.com/item?id=882537 - Oct 2009 (13 comments)


Dang is sorta like Shazam for hn articles.


I wonder how he works...?


HN search (in the page footer) uses the great service at https://hn.algolia.com, which will return those results, and more, indexing post titles, comments, and other categories for search.

Though I imagine he has a script for this use case though.

* "related. comments" https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...

* "related. comments dang" https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...

* "Related. comments dang news.ycombinator.com" https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...


Wow! Had no idea Shazam existed back in 2003.


contemporary with TunePrint https://www.mattababy.org/~belmonte/Tuneprint/ and at least one other. (Somehow TunePrint managed to get a record label to fill a fileserver with audio files and ship it back across country to use as training data - one of the stranger applications ever for AFS :-)


I was hoping it had something to do with poles and zeros, more analog-y Laplace stuff. Never expected hashes.


I wonder if it can be expanded to things like speeches, snippets of movies/TV shows, etc.


yes.



Haven't come across anything similar for movies


I worked at a startup way back when which using (I believe) a competing algorithm to match both images and audio. I really wish it had been Shazam... I might be rolling in it now...

That was... 2001ish?

(warning: too much alcohol in the system. Read with skepticism!)

I miss the dot com boom.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: