Question 1

How does a decision tree work?

Accepted Answer

A decision tree makes predictions by asking a series of yes/no questions about the input features. At each internal node, it tests one feature against a threshold and branches left or right. The process repeats until reaching a leaf node, which contains the prediction. Training involves finding the best feature and threshold at each node to maximize information gain or minimize impurity.

Question 2

What is information gain?

Accepted Answer

Information gain measures how much a split reduces uncertainty (entropy) about the class labels. A perfect split separates classes completely (gain equals initial entropy). The tree algorithm greedily picks the split with the highest information gain at each node. Gini impurity is an alternative measure — computationally cheaper and nearly identical in practice.

Question 3

Why do decision trees overfit?

Accepted Answer

Without constraints, a tree can grow until every training point has its own leaf — achieving 100% training accuracy but memorizing noise. This is why depth limits, minimum samples per leaf, and pruning are essential. Ensemble methods like Random Forests and Gradient Boosting use many shallow trees to get the expressive power of deep trees without overfitting.

Question 4

What are Random Forests?

Accepted Answer

A Random Forest trains many decision trees on random subsets of the data and features, then averages their predictions. This reduces the high variance (overfitting) of individual trees while preserving their ability to capture complex patterns. Random Forests are among the most reliable out-of-the-box ML algorithms, competitive with neural networks on tabular data.

Decision Trees: Splitting Data to Make Predictions

Formula

Divide and Classify

Axis-Aligned Partitions

Growing and Pruning

From Single Trees to Forests

FAQ

Sources

Embed

Decision Trees: Splitting Data to Make Predictions

Formula

Divide and Classify

Axis-Aligned Partitions

Growing and Pruning

From Single Trees to Forests

FAQ

Sources

Other simulations: Machine Learning & AI Algorithms

K-Means Clustering Visualizer

Gradient Descent Optimizer Visualizer

Neural Network Forward Pass Visualizer

Overfitting & Bias-Variance Tradeoff

Embed