Question 1

What is the Needleman-Wunsch algorithm?

Accepted Answer

The Needleman-Wunsch algorithm (1970) is a dynamic programming method for finding the optimal global alignment between two sequences. It builds a scoring matrix where each cell represents the best alignment score for subsequences ending at that position, then traces back through the matrix to recover the optimal alignment. It guarantees finding the mathematically optimal solution.

Question 2

What is the difference between global and local alignment?

Accepted Answer

Global alignment (Needleman-Wunsch) aligns sequences end-to-end, penalizing gaps at both ends. Local alignment (Smith-Waterman) finds the highest-scoring subsequence alignment, ignoring poorly matching regions at the ends. Global is appropriate for similar-length homologs; local is better for finding conserved domains within divergent sequences.

Question 3

How do gap penalties affect alignment?

Accepted Answer

Higher gap penalties (more negative) discourage the algorithm from inserting gaps, forcing more mismatches. Lower gap penalties produce more gapped alignments. Affine gap penalties (separate open and extend costs) model biological indel patterns more realistically, since extending an existing gap is more likely than opening a new one.

Question 4

What is the time complexity of Needleman-Wunsch?

Accepted Answer

The algorithm runs in O(mn) time and space, where m and n are the sequence lengths. For long genomic sequences, this becomes prohibitive — BLAST and other heuristic methods trade guaranteed optimality for dramatically faster search speeds, making database searches of millions of sequences practical.

Needleman-Wunsch Alignment Simulator: Global Sequence Alignment

Formula

The Alignment Problem

Dynamic Programming Solution

Scoring Schemes

From Pairwise to Database Search

FAQ

Sources

Embed

Needleman-Wunsch Alignment Simulator: Global Sequence Alignment

Formula

The Alignment Problem

Dynamic Programming Solution

Scoring Schemes

From Pairwise to Database Search

FAQ

Sources

Other simulations: Bioinformatics & Computational Biology

RNA-seq Differential Gene Expression

De Bruijn Graph Genome Assembly

UPGMA Phylogenetic Tree Construction

Protein Folding Energy Landscape

Embed