CDN (Content Delivery Network)

TL;DR

A CDN (Content Delivery Network) is a globally distributed network of edge servers — called Points of Presence (PoPs) — that cache copies of your content as close to users as possible. A request from Sydney to a New York origin takes ~300ms round-trip. To a Sydney PoP, it takes ~5ms.
Without a CDN, every request for a static file — your JavaScript bundle, your hero image, your video — travels all the way to your single origin server. Every byte traverses the public internet. At any kind of global scale, this is the first performance bottleneck that kills user experience.
At a 95% edge cache hit rate, your origin receives 20× less traffic. The same compounding math as application-level caching, applied to the outermost layer of your stack.
Static content (CSS, JS, images, fonts, video) should be CDN-served by default. Dynamic content (API responses, personalised pages) can still benefit from TCP acceleration and TLS termination at edge even when it can't be cached.
The hardest CDN problem is cache invalidation: when you update your JS bundle, every PoP worldwide must serve the new version. The two solutions are time-based TTL (simple, eventually consistent) and content-addressable URLs (instant, requires build tooling).

It's launch day. Your team spent three months building a news app. TechCrunch runs a front-page feature. Traffic spikes globally — engineers in Berlin, journalists in Tokyo, readers in São Paulo all hit the "read" button at once.

Your single origin server sits in US-East-1, Virginia. For a user in Sydney, a TCP connection alone takes ~160ms (speed of light across the Pacific). Add TLS handshake (~80ms), HTTP request (~40ms), response transfer (~80ms for a 500KB JS bundle): first contentful paint arrives in 360ms.

That's before the app even renders. I'll often see candidates gloss over this number in system design interviews — the latency budget is already spent before the framework renders a single pixel. Google's Core Web Vitals mark anything over 2.5 seconds LCP as "poor," and you're spending a third of that budget on pure network transit.

Meanwhile, your origin is receiving every image request, every font file, every versioned JavaScript bundle — from every user worldwide — simultaneously. Your bandwidth bill is calculated on egress bytes from your cloud provider at $0.09/GB. A 2MB average page weight × 50,000 concurrent users = 100GB egress in the first hour.

$9 for one hour of page loads, before your database or compute costs. And your single origin server becomes a single point of failure: one garbage collection pause or DB connection spike takes down users globally.

Directive	Scope	What it does
`public`	CDN + browser	Content is safe to cache by any intermediate cache
`private`	Browser only	Only the end user's browser may cache; CDN must not
`max-age=N`	Browser	Browser uses cached copy for N seconds without re-requesting
`s-maxage=N`	CDN only	CDN uses cached copy for N seconds (overrides `max-age` for CDNs)
`no-cache`	Both	Must revalidate with origin before serving (conditional GET with ETag)
`no-store`	Both	Never cache — don't write to disk or memory at all
`immutable`	Browser	Never send a conditional request; content identified by URL is permanent
`stale-while-revalidate=N`	CDN (RFC 5861)	Serve stale for N seconds while refreshing in background (no user waits)

Component	Role
PoP (Point of Presence)	A CDN data center in a specific city. Stores cached copies of your content. Routes traffic from users in the surrounding region. Major CDNs have 200–300+ PoPs.
Origin server	Your actual application server. The authoritative source. The CDN fetches from here on cache misses and for all non-cacheable content.
Cache-Control header	The HTTP response header from your origin that tells PoPs how long to cache, who can cache, and what to do with stale entries. The primary lever you have over CDN behaviour.
TTL (Time-To-Live)	The duration a PoP holds a cached response before considering it stale and re-fetching. Determined by `s-maxage` or `max-age` in your Cache-Control header.
CDN DNS / Anycast	The routing layer that maps a user's request to the geographically or topologically nearest PoP. GeoDNS uses client IP geolocation; Anycast uses BGP routing.
CDN Purge API	An API your deployment pipeline calls to invalidate specific cached paths or tags across all PoPs immediately, without waiting for TTL expiry. Critical for emergency rollbacks.
Origin shield	An optional intermediate caching tier between PoPs and your origin. When 10 PoPs all miss simultaneously, instead of 10 requests hitting your origin, they all converge on one "shield" node that makes a single request. Reduces origin fan-out.
TLS termination	The CDN handles the TLS handshake at the PoP, close to the user. This saves 80–100ms of TLS negotiation latency over terminating TLS at your distant origin.
ETag / If-None-Match	Conditional request headers. The CDN (or browser) sends the ETag of its cached copy; the origin returns `304 Not Modified` if content hasn't changed, saving bandwidth on the response body.
Edge functions	Serverless code that runs at the CDN PoP — e.g., Cloudflare Workers, Vercel Edge Runtime. Can dynamically modify responses, handle auth, rewrite URLs without origin round-trips.

Dimension	Push CDN	Pull CDN
How PoPs are populated	You upload content via CDN API at deploy time	CDN fetches from origin on first cache miss, per-PoP
First-request latency	Always HIT — content pre-populated	MISS on first request to any PoP → origin latency
Origin load	Zero reads after push completes	Origin sees 1 request per PoP per cache miss
Management overhead	High — must push on every content update	Low — CDN auto-manages; just set TTL headers
Storage cost	Pays for content at every PoP regardless of demand	Only caches content that gets requested
Cache invalidation	Delete via CDN API, re-push new version	Wait for TTL or call purge API
Best for	Large binaries, video, infrequently changing content	Websites, APIs, dynamic traffic patterns

Pros	Cons
Drastic latency reduction for global users — 200–300ms cross-continent trips become 5–30ms PoP hops	Cache invalidation complexity — stale content across 200+ PoPs requires pipeline discipline (content-addressable URLs or purge API)
Origin offload — 95%+ of read traffic absorbed at edge; origin infrastructure can be sized for 5× less load	Additional failure surface — CDN misconfiguration can serve stale content globally, or block all traffic if rules are wrong
Bandwidth cost reduction — CDN egress is typically cheaper than cloud provider egress ($0.01–0.04/GB vs $0.08–0.09/GB)	No benefit for non-cacheable dynamic content — personalized pages, authenticated API responses, real-time data bypass cache entirely
DDoS protection — CDN absorbs volumetric attacks at edge (Cloudflare: 154Tbps network capacity) before traffic reaches origin	Vendor dependency — CDN becomes critical infrastructure; outage or price change has immediate production impact
Improved availability — PoP redundancy isolates regional failures; if your origin has a brownout, CDN can serve stale content from `stale-if-error`	Debugging difficulty — cache hits at edge hide origin errors; `cf-cache-status: HIT` in headers means the user isn't seeing what your origin would send today
TLS termination at edge — users get fast TLS handshakes even when origin is distant	`Vary` header complexity — content negotiation (Accept-Encoding, Accept-Language) creates multiple cache variants per URL; misconfigured Vary bloats CDN storage

Interviewer asks	Strong answer
"What Cache-Control headers do you set for a JavaScript bundle?"	"`public, max-age=31536000, immutable`. The filename has a content hash — the URL changes every build. CDN and browsers can cache forever. No purge needed because a new deploy generates a new URL."
"How do you invalidate CDN cache after a deployment?"	"For statically built assets, I don't — content-addressed URLs handle it. For the HTML index files and any non-hashed content, I call the CDN purge API as the last step of deployment. For Cloudflare this propagates in ~1–2 seconds globally."
"What is origin shield and when do you need it?"	"An origin shield is an intermediate cache tier between PoPs and origin. Without it, a cold deploy causes every PoP worldwide (200+ for Cloudflare) to independently miss and each fetch from origin — a thundering herd at origin level. The shield aggregates those misses so origin gets one request, not 200. I'd add origin shield when deploys cause measurable origin CPU spikes."
"Your CDN has a 95% hit rate. What does your origin need to handle?"	"5% of peak traffic. If peak is 100K req/s, origin must handle 5K req/s plus 100% of all write traffic and all authenticated requests. The CDN hit rate is the multiplier — every percentage point of hit-rate improvement is a proportional reduction in origin load. Below 80% hit rate, CDN is barely pulling its weight."
"When would you not add a CDN?"	"Personalised content — user feed, cart, account pages — can never be cached and bypasses CDN caching. I still route this through a reverse proxy CDN for TLS termination and DDoS protection, but I don't expect caching benefits. And for real-time data (live scores, stock prices), CDN caching is actively harmful — any TTL introduces staleness that's directly user-visible."

CDN (Content Delivery Network)

TL;DR

The Problem It Solves

Comments

What Is It?

How It Works

Step 1: DNS routes the user to the nearest PoP

Step 2: PoP checks its local cache

Step 3: Cache miss → PoP fetches from origin

Step 4: All subsequent requests → cache HIT

Cache-Control header reference

Key Components

Types / Variations

Push CDN vs Pull CDN

Reverse Proxy CDN vs Origin CDN

Cache Invalidation

TTL expiry (default, lazy)

Content-addressable URLs (best practice for static assets)

CDN Purge API (emergency invalidation)

Trade-offs

When to Use It / When to Avoid It

Real-World Examples

How This Shows Up in Interviews

Test Your Understanding

Quick Recap