Q: How do you read formants on a spectrogram?

Formants appear as dark horizontal bands of concentrated energy. F1 is the lowest band (300-800 Hz), F2 is the next (800-2500 Hz), and F3 is higher (2000-3500 Hz). Their trajectories over time reveal vowel transitions and coarticulation.

Q: What is the time-frequency tradeoff?

The Heisenberg-Gabor uncertainty principle means you cannot simultaneously have perfect time and frequency resolution. Short windows give good time resolution but poor frequency resolution, and vice versa. A 25 ms window is a common compromise for speech.

Question 1

What is a speech spectrogram?

Accepted Answer

A spectrogram is a visual representation of how the frequency content of a sound changes over time. Time runs along the horizontal axis, frequency on the vertical axis, and intensity is shown by brightness or color. Speech spectrograms reveal formant patterns, voicing, and consonant noise.

Question 2

What is the difference between narrowband and wideband spectrograms?

Accepted Answer

Narrowband spectrograms use long analysis windows (>40 ms), resolving individual harmonics as horizontal lines. Wideband spectrograms use short windows (<10 ms), blurring harmonics but showing formant bands and temporal events like stop bursts clearly.

Question 3

How do you read formants on a spectrogram?

Accepted Answer

Formants appear as dark horizontal bands of concentrated energy. F1 is the lowest band (300-800 Hz), F2 is the next (800-2500 Hz), and F3 is higher (2000-3500 Hz). Their trajectories over time reveal vowel transitions and coarticulation.

Question 4

What is the time-frequency tradeoff?

Accepted Answer

The Heisenberg-Gabor uncertainty principle means you cannot simultaneously have perfect time and frequency resolution. Short windows give good time resolution but poor frequency resolution, and vice versa. A 25 ms window is a common compromise for speech.

Speech Spectrogram Simulator: Visualize the Time-Frequency Structure of Sound

Formula

Painting Sound in Time and Frequency

The Window Tradeoff

Reading the Patterns

From Spectrograms to Speech Recognition

FAQ

Sources

Embed

Speech Spectrogram Simulator: Visualize the Time-Frequency Structure of Sound

Formula

Painting Sound in Time and Frequency

The Window Tradeoff

Reading the Patterns

From Spectrograms to Speech Recognition

FAQ

Sources

Other simulations: Speech Science & Phonetics

Vowel Formant Analysis

Pitch Tracking & Intonation

Articulatory Speech Synthesis

Voice Onset Time (VOT) Analyzer

Embed