← Lessons

quiz vs the machine

Gold1360

Machine Learning

Ensemble Methods Overview

Combine many models so their collective prediction beats any single one.

4 min read · core · beat Gold to climb

Ensemble Methods Overview

Ensemble methods combine several models, called base learners, into one stronger predictor. The intuition is that diverse models make different errors, and averaging or voting cancels some of those errors out.

Why ensembles work

  • A single model has limited perspective. Many diverse models capture more of the truth together.
  • Combining predictions tends to reduce variance and sometimes bias.
  • Diversity is essential. Identical models add nothing, so we encourage variety through different data samples, features, or algorithms.

The main families

  • Bagging trains models on different bootstrap samples and averages them to cut variance.
  • Boosting trains models in sequence, each fixing the previous one's mistakes, to cut bias.
  • Stacking trains a meta model to combine the base predictions intelligently.

Tradeoffs

  • Ensembles usually improve accuracy but cost more compute and are harder to interpret.
  • They are workhorses for tabular data, where they frequently top leaderboards.

Key idea

Ensembles combine diverse base learners whose differing errors partly cancel, with bagging, boosting, and stacking offering distinct ways to build a stronger combined predictor.

Check yourself

Answer to earn rating on the learn ladder.

1. Why is diversity among base learners important?

2. Which ensemble family primarily reduces variance?