← Lessons

quiz vs the machine

Gold1380

System Design

Document Ranking Signals

The families of signals that decide which results rank highest.

5 min read · core · beat Gold to climb

What ranking decides

Retrieval finds candidates; ranking orders them. The order comes from combining many signals into a single score.

Signal families

  • Textual relevance measures how well query terms match a document, often via BM25.
  • Quality and authority capture how trustworthy or popular a document is, like link counts or domain reputation.
  • Freshness rewards recent content for time sensitive queries.
  • User behavior uses clicks and dwell time as feedback on what people prefer.
  • Context includes location, language, and device.

Combining signals

Early systems used hand tuned weights. Modern systems learn the combination from labeled data, a technique called learning to rank. The model takes signal values as features and predicts a relevance score.

Avoiding over weighting

No single signal should dominate. A page stuffed with query terms but low quality should not beat a trusted, relevant page. Balancing signals is the core ranking craft.

Diagram

Key idea

Ranking blends textual, quality, freshness, and behavior signals into one score, increasingly learned rather than hand tuned.

Check yourself

Answer to earn rating on the learn ladder.

1. What does learning to rank replace?

2. Why should no single signal dominate ranking?