What is Pitch Correction: an easy explanation

Some people think that anyone who can produce reasonable vocal sound can be made to sound like a good singer with modern technology. That is not really true, though a lot can be done nowadays to improve for instance a recorded vocal by performing pitch correction. When a vocal is good in terms of sound and feeling and there are only a few off-key notes, it may be worth while to perform pitch correction, if only to save time and money by not having to do the recording again, even if you could.

Here we try to explain wat pitch correction does in not too techical terms, though a few technical words are inevitable.

First of all, pitch is not equivalent to frequency, but closely related to it; it is a perceptual quality that is primarily determined by the fundamental frequency of a natural tone. Any tone produced by a voice or instrument contains harmonically related frequencies which are a multiple of the fundamental frequency. For instance, an ''A'' at 220 Hz (i.e. 220 vibrations per second) contains also frequencies of 440 Hz, 660, 880 Hz etc; this is just how vocal chords, strings etc vibrate naturally. In this example 220 Hz is the fundamental frequency and this we perceive as the pitch. However, the perceived pitch is also influenced a little by duration of a note and loudness; a very loud note tends to sound a bit higher in pitch. Then there is also some noise (e.g. breath noise); the noise does not have a pitch.

When a singer sings ''A" off-key, it might for instance consist of 215 Hz, 430, 645, 860 Hz etc. To restore the correct pitch, we cannot just shift all frequencies up by 5 Hz, because we woud get a wrong harmonic sequence, which sounds extremely bad, and with no clear pitch at all. So how does it work?

The simplest way is to speed up the sound, by playing it faster than it is recorded. This will make it sound a bit faster, but also higher in pitch. A big disadvantage is that a sound that is played back faster, also becomes shorter in time. In order to compensate for the change in duration, we can time stretch the sound, by cutting it into a lot of very short parts (like a few hundred per second) and then adding some of those same additional parts to make the sound longer again (or shorter by overlapping the parts). This way we can change pitch and preserve the duration.

A second disadvantage of changing the speed is that the pitch is changed, but also the timbre or colour of the sound. An adult voice will start to sound like a child's voice if pitched up this way. Why is this so? It has to do with so-called formants. A vocal sound is produced by a singer's vocal chords, but is colored also by the throat, mouth etc. These colorations are the formants. In technical terms, this is called the source-filter model. Now when a singer raises his/her pitch, the vocal chords (source) produce higher frequencies, but the ''coloration'' (filter) remains approximately the same, as the throat and mouth do not change. In contrast, when we just speed up a sound, both the pitch and the coloration are changed and so it sounds like a voice of someone with a smaller throat and mouth, like a child.

So, a second improvement is trying to preserve the formants. This can be done by first measuring the coloration (formants), and taking it out by applying an inverse coloration. Then, the ''uncolored'' neutral pitched sound is pitch corrected, and the original coloration is applied again to this. Now we have a good approximation of what the singer would have produced by singing a higher pitch.

There are a lot of technical diffulties and details to be taken into account in applying the above principles, but all modern pitch correction techniques have incorpated such methods, one way or the other. Pitch correction is mostly used in studios on vocal tracks, and sometimes even live on stage.

Until now, correction of vocal pitch of a voice inside an already mixed recording was not possible, because a pitch correction would pitch shift the whole music. CSP audio has a developed a new technique that can pitch correct a voice inside a mix, see www.pitchcorrection.com.

For those who are technically inclined here are some technical links about pitch correction:

Phase vocoder principles

More phase ocoder techniques

PSOLA