Question 1

What is UPGMA?

Accepted Answer

UPGMA (Unweighted Pair Group Method with Arithmetic Mean) is a hierarchical clustering algorithm that builds phylogenetic trees from a distance matrix. At each step, it joins the two closest clusters, placing the node at half their distance. UPGMA assumes a molecular clock — equal evolutionary rates across all lineages — producing ultrametric trees where all tips are equidistant from the root.

Question 2

What is a molecular clock?

Accepted Answer

The molecular clock hypothesis proposes that DNA and protein sequences accumulate mutations at a roughly constant rate over time. If true, the number of differences between two sequences is proportional to their divergence time. UPGMA relies on this assumption, while methods like neighbor-joining do not.

Question 3

When does UPGMA fail?

Accepted Answer

UPGMA produces incorrect tree topologies when the molecular clock assumption is violated — that is, when different lineages evolve at different rates. In such cases, fast-evolving lineages are incorrectly placed deeper in the tree. Neighbor-joining, maximum likelihood, and Bayesian methods handle rate variation more robustly.

Question 4

What is long-branch attraction?

Accepted Answer

Long-branch attraction is a systematic error where rapidly evolving lineages are incorrectly grouped together because their sequences have converged on similar compositions by chance. It affects parsimony methods most severely but can also bias distance-based methods when distance corrections are inadequate.

Phylogenetic Tree Simulator: UPGMA Tree Construction

Formula

From Sequences to Trees

The UPGMA Algorithm

The Molecular Clock Assumption

Beyond UPGMA

FAQ

Sources

Embed

Phylogenetic Tree Simulator: UPGMA Tree Construction

Formula

From Sequences to Trees

The UPGMA Algorithm

The Molecular Clock Assumption

Beyond UPGMA

FAQ

Sources

Other simulations: Bioinformatics & Computational Biology

RNA-seq Differential Gene Expression

De Bruijn Graph Genome Assembly

Protein Folding Energy Landscape

Needleman-Wunsch Global Sequence Alignment

Embed