Question 1

What is pitch in speech?

Accepted Answer

Pitch is the perceptual correlate of fundamental frequency (f₀) — the rate at which the vocal folds vibrate. Adult male f₀ typically ranges from 85-180 Hz, adult female from 165-255 Hz, and children from 250-400 Hz. Pitch variations encode intonation, stress, and tone.

Question 2

How does pitch tracking work?

Accepted Answer

Pitch tracking algorithms estimate f₀ by detecting periodicity in the speech waveform. Methods include autocorrelation (finding the period of maximum self-similarity), cepstral analysis (finding the quefrency peak), and neural network approaches. Praat's algorithm uses autocorrelation with dynamic programming.

Question 3

What is the difference between intonation and tone?

Accepted Answer

Intonation refers to pitch patterns at the phrase level that convey pragmatic meaning (questions vs. statements) in all languages. Tone refers to pitch patterns at the word/syllable level that change lexical meaning, as in Mandarin, Thai, and Yoruba.

Question 4

What does jitter tell us about voice quality?

Accepted Answer

Jitter is the cycle-to-cycle variation in f₀ period. Normal jitter is below 1%. Values of 1-3% may indicate mild voice strain or aging. Above 3% suggests pathological voice conditions. Jitter, along with shimmer (amplitude variation), is a standard clinical voice assessment metric.

Pitch Tracking Simulator: Visualize Speech Intonation Contours

Formula

The Melody of Speech

Intonation Contours

Measuring Pitch Perturbation

From Acoustics to Meaning

FAQ

Sources

Embed

Pitch Tracking Simulator: Visualize Speech Intonation Contours

Formula

The Melody of Speech

Intonation Contours

Measuring Pitch Perturbation

From Acoustics to Meaning

FAQ

Sources

Other simulations: Speech Science & Phonetics

Vowel Formant Analysis

Speech Spectrogram Viewer

Articulatory Speech Synthesis

Voice Onset Time (VOT) Analyzer

Embed