Question 1

What is differential gene expression analysis?

Accepted Answer

Differential expression analysis identifies genes whose mRNA levels differ significantly between two or more conditions (e.g., disease vs. healthy). RNA-seq quantifies transcript abundance by counting mapped reads per gene, then statistical tests (DESeq2, edgeR) identify genes with changes larger than expected by chance, controlling the false discovery rate.

Question 2

Why is the negative binomial distribution used for RNA-seq?

Accepted Answer

RNA-seq count data shows overdispersion — the variance exceeds the mean, unlike a Poisson distribution. The negative binomial distribution has an extra dispersion parameter that models this biological variability between replicates. Tools like DESeq2 estimate gene-wise dispersions using empirical Bayes shrinkage.

Question 3

What is statistical power in RNA-seq experiments?

Accepted Answer

Statistical power is the probability of correctly detecting a truly differentially expressed gene. It depends on fold change magnitude, sample size, sequencing depth, biological variability (dispersion), and the significance threshold. Low power means many real DE genes go undetected (false negatives).

Question 4

How many replicates do I need for RNA-seq?

Accepted Answer

The answer depends on the expected fold change and biological variability. For detecting 2-fold changes with moderate variability, 3 replicates provide ~70% power. For small fold changes (1.5×), 6-10 replicates may be needed. Biological replicates (independent samples) are far more valuable than technical replicates (re-sequencing the same library).

RNA-seq Differential Expression Simulator: Statistical Power & Experimental Design

Formula

From Reads to Expression

The Negative Binomial Model

Statistical Testing and Multiple Correction

Experimental Design Matters

FAQ

Sources

Embed

RNA-seq Differential Expression Simulator: Statistical Power & Experimental Design

Formula

From Reads to Expression

The Negative Binomial Model

Statistical Testing and Multiple Correction

Experimental Design Matters

FAQ

Sources

Other simulations: Bioinformatics & Computational Biology

De Bruijn Graph Genome Assembly

UPGMA Phylogenetic Tree Construction

Protein Folding Energy Landscape

Needleman-Wunsch Global Sequence Alignment

Embed