Pick a frequency. Let's say you picked 400 hertz. If you multiply this frequency by a simple fraction (1/2, 1/3, 2/5, etc) you get a new frequency that harmonizes with your original frequency. That means they sound nice when played together. The simpler the fraction, the better the two frequencies will sound (1/2 sounds nicer than 7/13, for example).
If you pick several of these fractions between 1 and 2 (such as 3/2, 4/3, 5/4, 2/1, etc) you create what's called a scale. All the frequencies in the scale harmonize with the starting frequency, but they don't necessarily harmonize with each other.
Musicians don't always want to play with the same starting frequency so they invented "equal temperament". The idea behind equal temperament is to create a scale using logarithms/exponents instead of fractions. Because it's logarithmic, any frequency in the scale can be used as a starting point and you'll get the same result.
Pure sine waves don't harmonize. If you play a sine wave at 400Hz, and continuously vary the frequency of another sine wave from say 700 to 900Hz, you won't hear any special consonance at 800. It will sound just as ugly as the neighbors.
What really harmonizes is the overtone series. The human voice, and instruments imitating it, have overtones at integer multiples of the main frequency. For example, if I sing a note at 400Hz, it will consist of a sum of sine waves at 400, 800, 1200 etc. When two such notes are sounding at the same time, and their overtone series partially match up - that's when you hear harmony. It's easy to see that it happens at small integer ratios.
The guy who came up with this idea (Sethares) also came up with an easy way to test it. He synthesized bell-like sounds whose overtone series aren't exactly integers. And sure enough, melodies with integer ratios of pitches sound horrible when played on that instrument, but melodies with tweaked ratios sound perfectly fine.
EDIT: Thank you HN! I believed this for years, but after writing this comment and getting some replies I went and checked, and it's not completely true. Matching overtones play a role, but simple frequency ratios sometimes work even without overtones, and there are proposed explanations for that. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2607353/#idm139...
If I open 2 tabs of https://www.szynalski.com/tone-generator/ and then listen to 440+880, and then change 880 to 850 it is a world of difference. I would definitely describe that difference as dissonance and consonance.
Now the overtone series IS important and is not always 'simple ratios', a good example in a real instrument is the strong minor third overtone of a carillon, and as expected writing in major for that instrument is hard.
Thank you for that link. I never thought of my browser as a test bench before. (Of course, now I want a DVM, function generator, scope, logic analyser, spectrum analyzer and all the other goodies ;) ).
I'm not sure if this is the same thing as consonance/dissonance, but the graph of sin(x) + sin(2x), an octave, is regular and pretty and the graph of sin(x) + sin(sqrt(2)x), a tritone, is much less so.
Except that if you use a frequency ratio of sin(x)+sin(2.01 x), which is really very close to an octave and really sounds just as consonant as an octave to almost all people, you almost the same "dissonant" picture:
The strange thing is, none of these "simple ratio" theories account for the fact that our brains allow a lot of "fuzziness" around these simple ratios, so much that you can't really call them simple ratios as they encompass a whole bunch of not-simple ratios as well.
That sine wave sounds a bit "fuzzy" to me, maybe the generator adds a small amount of overtones or aliasing. I tried another generator (http://onlinetonegenerator.com/) and the consonance feels weaker.
Interesting, I still hear it the same, dissonant and consonant, perhaps western music ruined me. Thanks for sharing.
Edit: Didn't see the url, makes my old reply obsolete:
Interesting. I tried to avoid clipping/aliasing by using audacity with as high quality audio as my system allows and I can still reproduce pretty much exactly what you hear on those websites. https://vocaroo.com/i/s0Be5CexLgVs is 440hz, then 440hz+880hz, then 440hz+850hz. But I would be interested in any repeatable signal that does not harmonize at all so do share!
This is why tuning pianos is so hard, btw. The overtones are way more important than anything else about each set of strings.
If you don't take care of the overtones, playing scales will create a sort of "wah" effect that was cool in the 60's, but not so desirable for the freshly tuned piano. It's one of many reasons straight up MIDI sounds so weird. (Instrument modelling and multiple samples fixes that).
And you have different temperments, which flavour the sounds in different ways even after the "clashing" overtones are taken care of.
It's all related to how phonemes, units of speech, make different vowels or consonants when the pitch is changed. You'd be surprised at how much a speech sound changes in perception just because of the pitch. It has everything to do with those "What do you hear?" memes out there. Our brains do interesting things to similar wave envelopes at different pitches.
Fascinating stuff if you're into that sort of thing.
>It's one of many reasons straight up MIDI sounds so weird
PSA for anyone who needs to hear it: MIDI doesn't have a sound any more than sheet music does.
General MIDI-compatible software tone generator in Windows 95 is no more "MIDI" than an untuned piano in an abandoned house is "classical music".
MIDI to music is what TCP/IP is for communication (incidentally, these protocols are of the same age). If you want to "hear" MIDI, turn on the radio. You will "hear" MIDI in the same way you are "seeing" TCP/IP now, reading this page.
The limitations of this protocol do have an effect on sound, but in a subtle way. For instance, implementation of polyphonic pitch bend / slide was not standardized in the 80s. As a result, it was pretty much absent from electronic instruments until recently. A new MIDI-based standard, MPE, addresses that.
Surprisingly, what makes tones sound consonant is a matter of controversy in music perception research. The ratio between fundamental frequencies theory is most easily debunked: if that were true, equal temperament tuning would sound unbearable, but the over-tone matching theory is not confirmed either thus far.
In fact, here are two recent studies that suggest life-time exposure plays a significant role in the perception of consonance. If that's true your own judgement of consonance of two sounds is not a good evaluation of a theory of consonance, because that judgement may be shaped by your cultural exposure.
> The ratio between fundamental frequencies theory is most easily debunked: if that were true, equal temperament tuning would sound unbearable, but the over-tone matching theory is not confirmed either thus far.
Strong disagree there. EDO results in pure 4ths and 5ths. And I’m not sure how you can separate ratio of fundamentals from overtone consonance. And I’m not sure how you can separate “pleasing” from culture/upbringing, or why anyone would ever think that you could. It’s immediately evident by different people having different tastes. I appreciate the research, but I don’t think any of this is at all surprising.
That some properties of “consonance” are shared between cultures seems unsurprising, too, since the sounds everyone is exposed to will follow the same underlying acoustic properties. You can’t form words without listening to the overtones above your fundamental frequency, and forming resonance creates a notable body experiences, so those ratios are going to play an important role in most cultures as a matter of course.
Maryanne Amacher made some music that relies on the non-linearity of ears, what is known as "otoacoustic emissions." I guess if you put a sound source that produces the sum of two pure sine waves up to a human ear then record what comes back out, it can add in combination tones.
I think it's easiest to perceive in Chorale 1: https://www.youtube.com/watch?v=rtmv6LxNJqs&t=3079s (I'm not sure if I only hear the low-pitched tones because of non-linearities of my amplifier and headphones at higher volumes though...)
It's way too easy to get overtones from a sine wave. Lots of music production involves compression of dynamic range into a smaller interval, but this introduces integer overtones. For a toy example, we can think of arctan as being a compression function that takes infinite dynamic range into the interval [-1,1]. Playing around with some Fourier series, it looks like arctan(sin(x)) has a bunch of extra odd harmonics over the sin(x) fundamental.
In May, there was a HN post about a statistical mechanical model that derived a scale from an overtone model. It would be cool to see what comes out of inharmonicity (like bells).
https://advances.sciencemag.org/content/5/5/eaav8490.full
But, if I play pure sin(t440) + sin(t440x2^(4/12)) + sin(t440x2^(7/12)), it still sounds like a major chord?
I'm on mobile, so I can't whip up a jsfiddle, but I know from experience this doesn't sound terrible?
It's hard to type the equations out on my phone, but the resulting wave when adding a and a# has a very large period and sounds bad, where a + c# has a shorter period and sounds good. I'd be curious about the pure sine wave which matches the period of the summed waves.
I feel like there's more to this. Maybe I can make a visualization with matching sound over the weekend.
Your lay explanation doesn't account for some important fundamentals - like the subtle distinctions between the different historic tunings, the fact that pure equal temperament is only really a thing on electronic instruments, the need for stretch tuning on some acoustic instruments but not others, the origins of the melodic minor, the harmonic minor, and the modes, the use of the dissonant major second in many folk musics around the world... and that's not even getting into the complications of composing in different styles.
It's not technically wrong to say that equal temperament is based on the 12th root of 2. But it's also not a complete description of real world tunings, especially in acoustic performance.
Generally, math turns out to be a bad way to understand music. There are elements in music that look like math, but the similarities turn out to be superficial and massively oversimplified. If you take them too literally you run into serious conceptual problems almost immediately.
In fact the defining feature of music is that it always slips through any simple bounded conceptual model. Music simply isn't simple. That's what makes it so interesting.
Temperament is irrelevant to most music, even if you're talking about western music using the 12 tone scale. The exceptions are keyboards and electronic music.
The simple reason is that most instruments can't even be played consistently in tune, to the point where you could figure out if they're playing in any temperament at all. String instruments are tuned by pure intervals across the open strings, then the player has to figure out how the other notes should sound. They will push notes up and down to make them sound more "right" in the immediate context. Wind instruments are a bag of compromises.
A large percentage of musician jokes are about intonation.
Early keyboard instruments were tuned in simple temperaments that a musician could learn how to do quickly. A harpsichord had to be tuned before every performance. Equal temperament required an instrument that stayed in tune long enough to make it worth hiring an expert to tune it.
> String instruments are tuned by pure intervals across the open strings
Depending on what you mean by a 'pure interval,' it might come as a surprise that string instrumentalists tend to tune their fifths narrower than 3/2 (and apparently sometimes narrower than a 12-EDO perfect fifth!). This is so the perfect fifth above the highest string is tuned correctly as the major third (and some octaves) above the lowest string. Otherwise, the interval would be a Pythagorean major third (81/64) which is somewhat dissonant.
(In "How Equal Temperament Ruined Harmony," Duffin recounts how in the late 1800s, even the best piano tuners in Britain were unable to get exact equal temperament, being off by about 1 cent per note in such a way that favored common keys. And, even so, equal temperament for pianos was not popular until the 1910s -- non-equal temperaments were favored due to their sound rather than just their practicality.)
I'm not surprised at all. I was taught to use perfect intervals myself, and it's how I've always seen it taught. But tuning is a very interesting topic.
These days I play double bass in a jazz band, so of course every single instrument has its own tuning quirks.
An amusing anecdote: I played in a band, and the drummer complimented my intonation. I asked him how a drummer knows anything about intonation. He said: "My college major was trombone."
They will push notes up and down to make them sound more "right" in the immediate context.
Not all of the stringed players. One of the harder things on violin for me, after years of fretted instruments, is still intonation. Makes a mandolin seem like a push-button version of a violin. (We will save that infernal stick-and-horsehair thing for another discussion.)
The flip side is that when the mandolin gets out of tune (and oh, it does), you just have to live with it until the next opportunity to fire up the tuner.
... and why did we in the west end up with a 12-note scale? 12 equal divisions of the octave lets you approximate a lot of the interesting ratios 2/3, 3/4 etc (only approximate, see 'even tempered') without the notes being too close together. I think I read the next optimum division would be to divide the octave into 43, but then you're getting far too muddy.
... and if its 12 equal semitones, why have some of them got proper names (CDE) and some treated as variants (sharp/flat). Or in other words, why the white keys and black keys on the piano?
Well lets take the 'foundation' note of your piece of music - this would, more or less, be the note that gets involved the most, although its a bit more complicated than that. Building from that note, and lets say we've chosen C to make things simpler, if you wanted to choose other semitones from the 12 available that would let you build lots of nice ratios starting from C and the note thats 3/2 above it which would be G, then you more or less end up with the white keys on the piano, and you leave the black ones out (mostly). Thats called the C-major scale. There's also a minor version where you choose slightly different semitones to get a 'sadder' sound, and thats C-minor.
The names of the notes (CDEFGAB) are the white keys on a piano are based on the C-major scale, with the sharps and flats defined in relation to them. Some sort of legacy naming convention that is now baked into musical notation at the lowest level and makes everything much less clear than it should be. Because it you want to transpose up or down and use a different foundation note (as happens literally all the time) then you have to use a mix of notes with proper names CDE and the sharps and flats (a mix of white and black piano keys) and thats when it gets confusing. Well, unless you know what you're doing I suppose, and then its not confusing.
In summary: the major and minor scales are kindof optimum selections of notes from the 12 available to make your song sound fairly good. Like a kindof best practice.
But the names of the notes are ridiculous. Its as if instead of the digits 0123456789 we had some weird number system where 3 didnt exist and we called it '4 flat' and 6 was replaced by '5 major' for no good reason.
It’s worth noting that equal temperament predates the concept of a logarithm function, so the methods they used were both clever and imprecise.
Also worth nothing that mean temperament was an intermediate step between Pythagorean tuning and equal temperament, musicians didn’t jump directly from simple fraction into equal temperament.
But the really weird thing is that, any ratio that is close to a simple fraction like (say) 3:2, but not quite, is in fact a much less simple fraction, maybe 67:23 or something.
Yet we don't need the simple ratios be entirely exact, they can be off a little and you get used to it and it's fine. That's why equal temperament even works.
So our brains like simple ratios but also like ratios that are almost kinda like simple ratios but not quite. How does that work? Isn't any ratio close to a "simple ratio" when you leave some wiggle room like this?
The problem is, once you construct a diatonic scale using these "Pythagorean" ratios, trying to play a diatonic scale that starts and ends on a different pitch, but using the same set of pitches, will have intervals that aren't in tune and sound terrible.
Musicians and theorists struggled with this for years, coming up with various compromises that sounded good in some "keys" and awful in others, until math provided the ultimate compromise: base your logarithmic scale on twelfth roots of two and every key will be exactly the same more-or-less-in-tune.
> Musicians and theorists struggled with this for years, coming up with various compromises that sounded good in some "keys" and awful in others, until math provided the ultimate compromise: base your logarithmic scale on twelfth roots of two and every key will be exactly the same more-or-less-in-tune.
This is inaccurate in two ways.
First, only keyboards and those playing with them, use a fixed temperament. Everyone else, strings, voices, brass, winds, adjusts the pitch of individual notes based on vertical and horizontal context. A c does not have a fixed number of Hz throughout a piece.
Second, math did not provide the ultimate compromise. Equal temperament was known as far back as the fourth century BC, and people were advocating for and composing in equal temperament in the sixteenth century. Rejecting equal temperament in favor of meantone temperings was a conscious decision, not a compromise from ignorance.
In my musical training I was taught that musicians figured it out in the west by trial error. Bach then wrote the Well Tempered Klavier to demonstrate pieces in all 12 major and 12 minor keys, in 1722.
There is a reference here however to a chinese mathematician who worked it out in 1584. I don't know if his work made it across the continent and influenced anyone. It is not likely that many musicians would have understood the math in the 16th century.
Notably, well-temperament is not equal-temperament. Also under-appreciated is that you can change the pitch of each strike of a key on a clavichord by striking harder/softer or pressing into the key. It uses a “hammer-on” like action, so if you press into the key after striking a note, you will increase the tension on the string and raise the pitch. For me, realizing this gave me a whole new appreciation for CPE Bach’s music. There’s probably a dissertation in that somewhere for somebody.
A klavier is not necessarily a clavichord, but an organ has some interesting tuning tricks up its sleeves with its various stops.
Organs are not really tuned with the stops, the stops engage or disengage banks ('ranks' in organ terminology) of related pipes. The only way this would affect tuning is if you have a secondary rank with an alternate tuning, you'd have to disable one rank and enable another.
Organs are tuned directly at the individual pipes (just like string instruments are tuned at the strings). Each pipe, depending on the kind of pipe used has a slide that goes in and out on one end to match the pipe length or, alternatively, a tiny tongue of metal that is rolled up or extended. Tuning (or 'voicing') an organ is super labor intensive and time consuming.
I am well aware. I have participated in the tuning of a (small) organ, as I am an organist. You can affect the perceived tuning of the instrument by engaging mutation stops / aliquots. Further, speaking from experience, in practice, stops have a tendency to go out of tune at different rates. Most organists know their instrument very well, have have an intuition for what keys will sound best with what stops at any given time on their instrument.
This doesn't really explain any of the music theory that's described on the linked page, unless you're meaning you're explaining a theoretical basis for harmony itself? The kind of music theory on the page is about triadic harmony, scales, the ways in which one triad moves to the next, non-chord tones, melody, and rhythm.
I do like thinking about tuning systems, however.
- If you have purely harmonic instruments, like bowed strings or the human voice (due to mode locking), there's some justification to try messing around with extended just intonation systems (ones where the ratios use prime factors that go beyond 2, 3, and 5). Ben Johnston has a workable notation for this. But, even after having played around with this for a while, I still struggle to hear things involving 7 or 11 as being in tune!
- For slightly inharmonic instruments, like pianos or plucked strings, just intonation seems to make a bit less sense... though La Monte Young's "The Well-Tuned Piano" with it's 2,3,7-based tuning does work pretty well. (And as someone else pointed out, piano tuners compensate for inharmonicity even for 12-EDO tuning by making all the intervals just a bit wider. So twelfth roots of two don't completely explain things.)
- I'd be interested in seeing how highly inharmonic instruments, like bells, might be tuned to take advantage of their own unique harmonies. It might be that a "5/4" ratio means "take the fifth harmonic of this bell, then find a bell for which that harmonic is the 4th harmonic."
Equal temperament, by the way, apparently wasn't popular until around 1900-ish, and so-called equal temperaments before that tended to be various kinds of unequal but "circular" temperaments that worked well enough for every key, yet still preferred common keys. See [1].
12-tones is an approximation of tuning systems that had been long used by singers and string players. A C# is slightly lower in pitch than a Db (in cents, this is roughly 87 cents vs 109 cents; musicians should be able to perceive deviations of 5 cents, for comparison). In fact, there were early experiments in having split black keys to be able to have both pitches on a keyboard. In [1], the author argues that 1/6-comma meantone is a good approximation for this system, which can be explained as 55-EDO. This makes sweeter major thirds (closer to 5/4 than 12-EDO's 81/64) at the expense of more dissonant perfect fifths, which tends to work out OK for post-medieval western harmonic practice.
[1] Ross Duffin, "How Equal Temperament Ruined Harmony"
Pick a frequency. Let's say you picked 400 hertz. If you multiply this frequency by a simple fraction (1/2, 1/3, 2/5, etc) you get a new frequency that harmonizes with your original frequency. That means they sound nice when played together. The simpler the fraction, the better the two frequencies will sound (1/2 sounds nicer than 7/13, for example).
If you pick several of these fractions between 1 and 2 (such as 3/2, 4/3, 5/4, 2/1, etc) you create what's called a scale. All the frequencies in the scale harmonize with the starting frequency, but they don't necessarily harmonize with each other.
Musicians don't always want to play with the same starting frequency so they invented "equal temperament". The idea behind equal temperament is to create a scale using logarithms/exponents instead of fractions. Because it's logarithmic, any frequency in the scale can be used as a starting point and you'll get the same result.