The Resonant Vocal Tract
The human vocal tract — from the glottis to the lips — acts as an acoustic resonator roughly 17 cm long. Like any tube open at one end, it amplifies certain frequencies while attenuating others. These amplified frequency bands are called formants, numbered F1, F2, F3, and so on. By reshaping the tract (moving the tongue, jaw, and lips), speakers continuously alter these resonances to produce different vowels.
F1 and F2: The Vowel Code
Decades of phonetics research have shown that the first two formants carry almost all the information needed to identify a vowel. F1 tracks tongue height: close vowels like /i/ and /u/ have low F1 (~300 Hz), while open vowels like /a/ have high F1 (~800 Hz). F2 tracks tongue frontness: front vowels like /i/ have high F2 (~2200 Hz), while back vowels like /u/ have low F2 (~800 Hz). This creates the classic vowel space triangle.
Formant Bandwidth and Quality
Each formant has a bandwidth — the range of frequencies it amplifies. Narrow bandwidths produce sharp, ringing formants with clear vowel quality. Wider bandwidths, caused by increased damping from nasal coupling or breathy voice, produce more diffuse resonances. In pathological speech, abnormal bandwidths can indicate vocal tract dysfunction, making formant analysis a clinical diagnostic tool.
Applications in Technology
Formant analysis underpins modern speech technology. Automatic speech recognition systems extract formant features to identify phonemes. Voice synthesis engines manipulate formant tracks to generate natural-sounding speech. Forensic phonetics uses formant measurements for speaker identification. Even singing pedagogy uses formant tuning — opera singers learn to align formants with harmonics to project their voice over an orchestra.