Phonetic Vowel Space: Mapping the Sounds of Language

simulator intermediate ~9 min
Loading simulation...
5 vowels: /a e i o u/ — maximally dispersed

The world's most common vowel system has 5 vowels positioned at the extremes of the vowel space — maximizing acoustic distinctiveness. The F1-F2 formant plot reveals that tongue height and backness are the primary dimensions that distinguish vowels across all human languages.

Formula

Euclidean distance: d(v1, v2) = sqrt(Math.pow(F1_1 - F1_2, 2) + Math.pow(F2_1 - F2_2, 2))
Dispersion index: DI = (2 / (n*(n-1))) * sum(d(v_i, v_j)) for all pairs
Vowel space area (3 corner vowels): A = 0.5 * |F1_i*(F2_a - F2_u) + F1_a*(F2_u - F2_i) + F1_u*(F2_i - F2_a)|

The Geometry of Vowel Sounds

Every vowel you produce is shaped by the position of your tongue inside your mouth. Raise it high and forward, and you get /i/ (as in 'beet'). Lower it and pull it back, and you get /ɑ/ (as in 'father'). The acoustic fingerprint of each position is captured by two frequencies — F1 and F2 — which together create a map where every vowel has a unique coordinate. This is the vowel space.

Why Languages Choose Their Vowels

Of the infinite possible tongue positions, languages select only a handful of vowels — typically 5 to 7. But the choices are not random. Dispersion Theory, proposed by Lindblom in 1972, explains that languages space their vowels to maximize distinctiveness: the further apart two vowels are in acoustic space, the easier they are to tell apart. The universal 5-vowel system /a, e, i, o, u/ represents the most efficient packing of maximum contrast into minimum inventory.

Formants: The Physics of Vowels

When you speak, your vocal cords produce a buzz of many frequencies. Your tongue, lips, and jaw shape the vocal tract into a resonating tube that amplifies certain frequencies (formants) and dampens others. F1, the lowest formant, drops as the tongue rises. F2, the second formant, rises as the tongue moves forward. Together they explain over 90% of vowel perception — which is why a simple 2D plot can represent the entire vowel space.

Cross-Linguistic Patterns

The most striking finding in phonetic typology is the regularity of vowel systems. Three-vowel languages almost always choose /a, i, u/ — the three corners of the vowel space. Five-vowel languages add /e/ and /o/. Seven-vowel languages fill in further. Each addition maximizes the distance from existing vowels, as if languages are solving an optimization problem. This simulation lets you see that optimization in action.

FAQ

What is the vowel space in phonetics?

The vowel space is a two-dimensional acoustic map where vowels are plotted by their first formant (F1, correlating with tongue height) and second formant (F2, correlating with tongue backness). This creates the familiar trapezoidal IPA vowel chart, which represents the full range of vowel sounds producible by the human vocal tract.

Why do most languages have 5 vowels?

The 5-vowel system /a, e, i, o, u/ maximizes the perceptual distance between vowels given the constraints of the vocal tract. This is called Dispersion Theory: languages tend to spread their vowels as far apart as possible in the acoustic space to minimize confusion, and 5 vowels achieve an efficient balance.

What are formant frequencies?

Formants are resonant frequencies of the vocal tract. F1 (first formant, 200–900 Hz) varies inversely with tongue height — low vowels like /a/ have high F1. F2 (second formant, 700–2500 Hz) correlates with tongue backness — front vowels like /i/ have high F2. Together they uniquely identify each vowel.

How does the IPA vowel chart work?

The IPA (International Phonetic Alphabet) vowel chart arranges vowels in a trapezoid with tongue height on the vertical axis (close/high to open/low) and backness on the horizontal axis (front to back). Rounding is shown by paired symbols. It's a schematic map of the articulatory-acoustic vowel space.

Sources

Embed

<iframe src="https://homo-deus.com/lab/linguistics/phonetic-vowel-space/embed" width="100%" height="400" frameborder="0"></iframe>
View source on GitHub