Follower Graph Storage

The graph behind the feed

Every feed operation needs the follow graph, the set of directed edges where one user follows another. Fan out needs the followers of an author. Feed assembly needs the accounts a user follows. Both must be fast.

Two views of the same edge

A follow is one edge but is read in two directions, so systems store both:

A followers list indexed by author, used at post time to find who to fan out to.
A following list indexed by user, used at read time to know whose posts to gather.

Keeping both indexes means a new follow writes to two places, but each query reads exactly one.

Scaling the graph

The graph is huge and skewed, so it is partitioned, often by user id.
Edges are simple, so a key value or wide column store handles billions of them well.
Hot accounts with millions of followers may need their follower list sharded across partitions to avoid a single hot key.

Why a real graph database is rare here

Feeds mostly need one hop, the direct neighbors, not deep traversals. A partitioned key value design serves that one hop pattern faster and cheaper than a general graph engine.

Key idea