← Lessons

quiz vs the machine

Platinum1780

Machine Learning

The Position Bias Correction

Why top slots get clicked more and how to stop the model believing it.

6 min read · advanced · beat Platinum to climb

The trap in click logs

Items shown at the top of a list get more clicks simply because users look there first. This is position bias: an item at slot one and the same item at slot ten receive very different click rates regardless of true relevance. Training naively on clicks teaches the model that high positions are good, a self reinforcing loop.

Why it matters

  • The model learns to favor whatever was already ranked high, not what is truly best.
  • New or buried good items never get a fair chance to prove themselves.
  • Offline metrics look fine while the system slowly ossifies.

Modeling the bias

A common fix is the examination hypothesis: the chance of a click equals the chance the user examined the position times the chance the item is relevant. If examination depends only on position, you can separate the two.

Correction techniques

  • Inverse propensity weighting reweights each logged click by one over its position examination probability, removing the slot advantage.
  • Position as a feature feeds the slot into training, then sets it to a neutral value at serving so the model uses relevance, not position.
  • Randomization occasionally shuffles results to estimate examination curves directly.

The payoff

Correcting position bias lets the ranker learn true relevance, surfacing items that were unfairly buried and breaking the feedback loop.

Key idea

Position bias makes top slots look better than they are; modeling examination and using inverse propensity weighting separates true relevance from the slot advantage.

Check yourself

Answer to earn rating on the learn ladder.

1. What is position bias in click logs?

2. How does inverse propensity weighting correct position bias?

3. Why is uncorrected position bias a self reinforcing problem?