Limited Time Offer:Up to 0% off Hello Interview Premium

Up to 0% off Hello Interview Premium 🎉

⌘K

Pricing

Tutor

Get Premium

Full Article

Quick Reference

Sharding

Shard Decision

When To Shard

Prove the bottleneck

Identify storage, write, or read limits before proposing sharding.

Check storage limits

A single database eventually caps out; Amazon Aurora maxes around 256 TiB.

Check throughput limits

50K writes/s or 100M DAU with many queries can justify distributing load.

Avoid premature sharding

Sharding adds shard keys, routing, hotspots, rebalancing, and consistency work.

Split Types

Partitioning splits data within one database instance to improve scans and maintenance, and Sharding splits data across multiple machines to scale storage and read/write throughput.

Shard Key

Pick high cardinality

user_id gives millions of values; is_premium gives only two.

Prefer even distribution

Avoid country if 90% of users are in the US.

Match query patterns

Common reads and writes should hit one shard, such as a user's data by user_id.

Avoid time-only keys

created_at makes all new writes hit the latest shard.

Distribution

Shard Assignment

Hash-Based Sharding is the default for most designs and evenly distributes keys but needs a resharding plan, Range-Based Sharding is simple and supports range scans but skewed ranges create hotspots, and Directory-Based Sharding is flexible for moving hot users but adds a lookup and critical dependency.

Resharding Methods

Consistent hashing minimizes data movement when adding or removing shards, and Simple modulo remaps almost every record when hash key modulo N changes from 4 to 5.

Sharding Pitfalls

Hot Spots

Celebrity problem means a hot user can drive 1000x more traffic to one shard and should be isolated, Compound shard keys such as hash user_id plus date spread one hot user's data over time, and Dynamic shard splitting lets the MongoDB balancer split and migrate chunks to maintain balance.

Fan-Out Reads

Cache results stores global results for 5 minutes when real-time accuracy is not required, Denormalize data duplicates related data onto one shard for common reads, Precompute globals uses background jobs for trending content, and Accept rare fan-out allows infrequent admin totals to query and aggregate all shards.

Distributed Writes

Avoid cross-shard transactions is the best solution by keeping all of a user's data on one shard, Saga pattern coordinates multiple shards with independent steps and compensating actions, Two-phase commit guarantees consistency but is slow and fragile, and Eventual consistency works for denormalized counts that can briefly differ and converge.

Managed Sharding

Cassandra uses a partitioner with virtual nodes to map partition keys to token ranges, DynamoDB hashes partition keys to internal partitions and splits or merges as they grow, MongoDB uses range-based chunks with hashed shard keys as ranges over hash space, and Vitess and Citus are SQL sharding layers for MySQL or PostgreSQL that handle routing and resharding.

Your account is free and you can post anonymously if you choose.

Reading Progress

On This Page

Shard Decision

When To Shard

Split Types

Shard Key

Distribution

Shard Assignment

Resharding Methods

Sharding Pitfalls

Hot Spots

Fan-Out Reads

Distributed Writes

Managed Sharding

Sharding

Shard Decision

When To Shard

Prove the bottleneck

Check storage limits

Check throughput limits

Avoid premature sharding

Split Types

Shard Key

Pick high cardinality

Prefer even distribution

Match query patterns

Avoid time-only keys

Distribution

Shard Assignment

Resharding Methods

Sharding Pitfalls

Hot Spots

Fan-Out Reads

Distributed Writes

Managed Sharding

Comments

Questions

Learn

Links

Legal

Contact