← Lessons

quiz vs the machine

Gold1350

Machine Learning

Data Augmentation for Images

Expanding image data with label preserving transforms.

5 min read · core · beat Gold to climb

Data Augmentation for Images

Labeled images are scarce and expensive. Data augmentation creates new training examples by applying transformations that change pixels but keep the label correct.

Common transforms

  • Flips and rotations present the object from new orientations.
  • Crops and scaling teach the model that position and size vary.
  • Color jitter alters brightness and contrast for lighting robustness.
  • Noise and blur simulate imperfect cameras.

Why it helps

Augmentation acts as a regularizer. By seeing many variants of the same object, the model learns features that are invariant to those changes rather than memorizing exact pixels. This reduces overfitting and improves generalization to real world variation.

Label preservation

The golden rule is that a transform must not break the label. Flipping a cat horizontally is still a cat, so it is safe. But flipping the digit two might create something that no longer reads as a two, and mirroring text destroys it. Choosing augmentations that respect the task is essential, and applying them randomly each epoch maximizes variety.

Key idea

Image augmentation generates label preserving variants so the model learns invariant features and overfits less.

Check yourself

Answer to earn rating on the learn ladder.

1. Why does augmentation improve generalization?

2. What is the key constraint on an augmentation?