Question 1

What is articulatory synthesis?

Accepted Answer

Articulatory synthesis generates speech by simulating the physics of the vocal tract. Instead of concatenating recorded speech segments, it models the tube-like resonances created by tongue, jaw, and lip positions. This produces highly flexible and naturaloutput but requires accurate acoustic-articulatory models.

Question 2

How does tongue position affect formants?

Accepted Answer

Tongue height primarily controls F1 — high tongue = low F1, low tongue = high F1. Tongue frontness primarily controls F2 — front tongue = high F2, back tongue = low F2. These relationships, established by Fant (1960), are the foundation of acoustic phonetics.

Question 3

What role does lip rounding play?

Accepted Answer

Lip rounding extends the effective length of the vocal tract by several centimeters, lowering all formant frequencies. It has the strongest effect on F2 and F3. This is why rounded vowels (/u, o, y/) have lower F2 than their unrounded counterparts (/ɯ, ɤ, i/).

Question 4

Can articulatory synthesis sound natural?

Accepted Answer

Modern articulatory synthesizers like VocalTractLab and DIVA produce intelligible and increasingly natural speech. The challenge is controlling the many degrees of freedom smoothly during connected speech. Articulatory synthesis is particularly valuable for research on speech disorders and second language learning.

Articulatory Speech Synthesis Simulator: Shape the Vocal Tract

Formula

From Articulation to Sound

The Source-Filter Model

Articulatory-Acoustic Mappings

Building a Vocal Tract

FAQ

Sources

Embed

Articulatory Speech Synthesis Simulator: Shape the Vocal Tract

Formula

From Articulation to Sound

The Source-Filter Model

Articulatory-Acoustic Mappings

Building a Vocal Tract

FAQ

Sources

Other simulations: Speech Science & Phonetics

Vowel Formant Analysis

Pitch Tracking & Intonation

Speech Spectrogram Viewer

Voice Onset Time (VOT) Analyzer

Embed