2. Networking — System Design Concepts

🌐 2 · NETWORKING

How a Website Works — Journey of a URL

What happens from the moment you type a URL to seeing the page — end-to-end in 8 steps

Step	What Happens	Protocol / Layer	Latency
① URL Parse	Browser extracts scheme, host, path from `https://example.com/page`	Browser internals	<1ms
② DNS Lookup	Resolve `example.com` → IP address. Checks browser cache → OS cache → resolver → root/TLD/authoritative	DNS (UDP :53)	0-100ms
③ TCP Handshake	3-way handshake: SYN → SYN-ACK → ACK. Establishes reliable connection	TCP (L4)	1 RTT (~30ms)
④ TLS Handshake	Negotiate cipher, exchange keys, verify certificate. TLS 1.3 = 1 RTT, TLS 1.2 = 2 RTT	TLS (L5/L6)	1-2 RTT (~50ms)
⑤ HTTP Request	Send `GET /page HTTP/2` with headers (Host, Accept, Cookie, Auth)	HTTP (L7)	<1ms
⑥ Server Processing	Load balancer → app server → DB query → build response	Application	50-500ms
⑦ HTTP Response	Server sends `200 OK` + HTML body. May be chunked/streamed	HTTP (L7)	Depends on size
⑧ Render	Parse HTML → fetch CSS/JS/images (parallel) → build DOM → paint pixels	Browser engine	100-2000ms

Key insight: Steps ②③④ are the connection overhead — this is why HTTP/2 multiplexing (reuse one connection), connection pooling, and CDNs (shorter distance = fewer RTTs) matter so much for performance.

Optimizations: DNS prefetch (<link rel="dns-prefetch">) · Preconnect (TCP+TLS ahead of time) · HTTP/2 push (send CSS before browser asks) · Edge caching (skip steps ⑥⑦ entirely) · TLS 1.3 0-RTT (resume without handshake)

Interview framing: When asked "what happens when you type a URL", walk through these 8 steps. It shows you understand DNS, TCP, TLS, HTTP, and server architecture — the full networking stack in one answer.

OSI Model

7-layer reference model for network communication — know L4 and L7 for interviews

#	Layer	Protocol	System Design Relevance
7	Application	HTTP, gRPC, DNS, SMTP	L7 Load Balancers (ALB), API Gateways, WAF
6	Presentation	TLS/SSL, JSON, Protobuf	Encryption, serialization
5	Session	WebSocket, RPC	Connection management
4	Transport	TCP, UDP, QUIC	L4 Load Balancers (NLB), port-based routing
3	Network	IP, ICMP	Routing, subnets, VPCs, BGP
2	Data Link	Ethernet, ARP	MAC addresses, switches
1	Physical	Fiber, copper	Data center hardware

TCP / UDP

Transport layer — TCP (reliable, ordered) vs UDP (fast, unreliable)

TCP — 3-Way Handshake

UDP — Fire and Forget

Unicast (TCP)

Broadcast (UDP)

Multicast (UDP)

TCP	UDP
Connection-oriented: 3-way handshake (SYN→SYN-ACK→ACK)	Connectionless: Send immediately
Ordered + guaranteed: Retransmits lost packets	Best-effort: No retransmission, may arrive out-of-order
Flow + congestion control: Sliding window, slow start	No flow control — sender controls pace

Guarantees: TCP guarantees ordered, reliable, duplicate-free delivery via sequence numbers, ACKs, and retransmission. UDP guarantees nothing — but that's why it's fast (no overhead).

TCP: Banking, HTTPS, DB connections, email. Ports: 80/443/3306/5432/6379/9092

UDP: Live video, gaming, DNS, VoIP, QUIC (HTTP/3). Ports: 53/123/443(QUIC)

Real-world: Fortnite/Valorant use UDP for player positions (few lost packets OK, low latency critical). QUIC (HTTP/3) — UDP-based with built-in TLS 1.3, 0-RTT resumption. Used by Chrome, YouTube, Cloudflare.

HTTP / HTTPS

Application layer — HTTP (plaintext) vs HTTPS (TLS encrypted)

HTTP — Plaintext

HTTPS — TLS Encrypted

▸ How HTTPS Works — TLS Handshake

Method	Purpose	Idempotent	Safe	Cacheable
GET	Retrieve	✓	✓	✓
POST	Create	✗	✗	✗
PUT	Replace	✓	✗	✗
PATCH	Partial update	✗	✗	✗
DELETE	Remove	✓	✗	✗

HTTPS Guarantees: Confidentiality (AES-256 encryption) · Integrity (SHA-256 HMAC, tamper-proof) · Authenticity (CA-signed certificate proves server identity) · Forward secrecy (ECDHE — past sessions safe even if key leaked).

2xx — Success
200	OK	201	Created	202	Accepted (async)
204	No Content	206	Partial Content (streaming)
3xx — Redirect
301	Moved Permanently	302	Found (temp redirect)	304	Not Modified (cache)
307	Temp Redirect (keep method)	308	Perm Redirect (keep method)
4xx — Client Error
400	Bad Request	401	Unauthorized	403	Forbidden
404	Not Found	405	Method Not Allowed	408	Request Timeout
409	Conflict	410	Gone (deleted)	413	Payload Too Large
415	Unsupported Media Type	422	Unprocessable Entity	429	Too Many Requests
451	Unavailable For Legal Reasons
5xx — Server Error
500	Internal Server Error	501	Not Implemented	502	Bad Gateway
503	Service Unavailable	504	Gateway Timeout

HTTP/2: Binary framing, multiplexing, header compression (HPACK), server push. HTTP/3: QUIC-based (UDP), no head-of-line blocking, 0-RTT.

HTTP/1.1 vs 2 vs 3: HTTP/1.1 — one request per TCP connection (or pipelining, rarely used). HTTP/2 — multiplexed streams over single TCP, but TCP head-of-line blocking remains (1 lost packet stalls all streams). HTTP/3 (QUIC) — each stream independent over UDP, lost packet only stalls its own stream. 0-RTT resumption — reconnect without handshake (TLS 1.3 session tickets). Adopted by Chrome, YouTube, Cloudflare, Meta.

DNS

Translates domains → IP addresses. The internet's phone book — hierarchical, cached, eventually consistent.

▸ DNS Resolution — Full Lookup Path

Record	Purpose	Example	TTL Guidance
A / AAAA	Domain → IPv4 / IPv6	example.com → 93.184.216.34	300s (failover) to 86400s (stable)
CNAME	Alias to another domain	www → example.com	Can't coexist with other records at apex
ALIAS / ANAME	CNAME at zone apex	example.com → cdn.provider.com	Provider-specific (Route 53, Cloudflare)
MX	Mail routing (priority)	10 mail.example.com	3600s typical
TXT	Verification, SPF, DKIM	v=spf1 include:_spf.google.com	3600s
SRV	Service discovery (port + weight)	_http._tcp.example.com 8080	Used by Consul, K8s
NS	Delegate to nameservers	ns1.example.com	86400s (rarely changes)
CAA	Which CAs can issue certs	0 issue "letsencrypt.org"	Security: restrict cert issuance

TTL strategy: Low TTL (60s) = fast failover, higher query load, good for active-passive DR. High TTL (86400s) = less load, slow propagation, good for stable records. Pre-lower TTL before migrations (drop to 60s 24h before cutover).

Real-world: Route 53 — GeoDNS + health checks for multi-region failover. Cloudflare — 1.1.1.1 resolves in ~11ms globally. Anycast — same IP announced from multiple PoPs (nearest wins). DNSSEC — cryptographic chain of trust preventing DNS spoofing.

Anti-patterns: High TTL before migration — users stuck on old IP for hours. CNAME at apex — breaks MX/NS records. No health checks — DNS routes to dead servers. Relying on DNS for sub-second failover — TTL caching prevents it.

IP & CIDR

Address space, subnet sizing, and private ranges — CIDR suffix = network bits, smaller suffix = bigger block

CIDR	Hosts	Typical Use	Subnet Mask
`/32`	1	Single host (allow-list a server)	255.255.255.255
`/28`	16	Small subnet (NAT GW, bastion)	255.255.255.240
`/24`	256	Standard subnet (one AZ)	255.255.255.0
`/20`	4,096	Large subnet (K8s pod CIDR)	255.255.240.0
`/16`	65,536	VPC per region	255.255.0.0
`/8`	16.7M	Entire org (10.0.0.0/8)	255.0.0.0

Private ranges (RFC 1918): 10.0.0.0/8 · 172.16.0.0/12 · 192.168.0.0/16. Used inside VPCs; not routable on public internet. Plan CIDR carefully — VPC peering requires non-overlapping ranges.

VPC design: Use /16 per VPC, split into /24 subnets per AZ. Separate public (LB), private (app), and isolated (DB) subnets. Leave room for growth — you can't resize a VPC CIDR easily.

Anti-patterns: Overlapping CIDRs — can't peer VPCs. Too small VPC — run out of IPs when scaling. /16 subnets — waste addresses, broadcast domain too large. Using 172.17.0.0/16 — conflicts with Docker default bridge.

Key Ports Cheat Sheet

Standard ports you'll meet in any architecture diagram — ports < 1024 are privileged (need root)

Port	Service	Protocol	Security Notes
`22`	SSH	TCP	Key-based auth only, disable password login
`53`	DNS	UDP/TCP	UDP first, TCP for > 512B or zone transfers
`80`	HTTP	TCP	Redirect to 443, never serve sensitive data
`443`	HTTPS	TCP	TLS 1.3, HSTS header, cert pinning for mobile
`3306`	MySQL	TCP	Private subnet only, never expose publicly
`5432`	PostgreSQL	TCP	SSL mode = require, restrict to app CIDR
`6379`	Redis	TCP	No auth by default — always set requirepass + ACL
`9092`	Kafka	TCP	9093 for TLS, SASL for auth
`9200`	Elasticsearch	TCP	Never expose publicly (data exfil risk)
`27017`	MongoDB	TCP	Enable auth, bind to private IP only
`8080/8443`	App servers	TCP	Non-privileged, behind LB on 80/443
`2379/2380`	etcd	TCP	Client/peer ports, mTLS required

Rule of thumb: Run apps on 8080/8443 (non-privileged) and let a load balancer terminate 80/443. Only expose ports that must be public. Database ports should never be reachable from the internet.

Common breaches: Open Redis (6379) — cryptominer injection. Open Elasticsearch — data exfiltration. Open MongoDB — ransomware. Always scan with nmap or cloud security tools.

Firewalls — Security Groups vs NACLs

Two layers of network filtering in every cloud VPC — defense in depth

▸ VPC Network Security — Layered Filtering

Aspect	Security Group	NACL
Scope	Instance / ENI (can reference other SGs)	Entire subnet
State	Stateful — return traffic auto-allowed	Stateless — must allow both directions explicitly
Rules	Allow only (implicit deny all)	Allow + Deny (explicit deny possible)
Evaluation	All rules evaluated (most permissive wins)	Numbered, first match wins
Use case	Fine-grained: "app SG can talk to DB SG on 5432"	Coarse: "block this CIDR range entirely"
Limits	~60 rules per SG, 5 SGs per ENI	20 rules per NACL (soft limit)

Best practice: Least-privilege allow-list per port. Default deny everything. Reference SG-to-SG instead of CIDR (survives IP changes). Use NACLs as a coarse "block this CIDR" knife for known-bad ranges.

K8s equivalent: NetworkPolicy — pod-level firewall (Calico, Cilium). Default deny all ingress/egress, then allow specific label selectors. Service Mesh (Istio) adds L7 policies (allow GET /api but deny POST).

Anti-patterns: 0.0.0.0/0 on DB port — database exposed to internet. Single SG for everything — no isolation between tiers. Overly permissive egress — allows data exfiltration. No logging — can't detect unauthorized access.

Zero Trust Networking

"Never trust, always verify" — no implicit trust for internal traffic. Every request authenticated, authorized, encrypted.

▸ Zero Trust — Every Hop Verified

Component	Purpose	Tools
Service Identity	Cryptographic identity per workload	SPIFFE/SPIRE, K8s ServiceAccount, AWS IAM Roles
mTLS	Mutual authentication + encryption	Istio, Linkerd, Consul Connect, Cilium
Policy Engine	Fine-grained authorization (who can call what)	OPA/Rego, Istio AuthorizationPolicy, Cedar
Cert Management	Auto-rotate short-lived certificates	cert-manager, Vault PKI, SPIRE
BeyondCorp Proxy	Identity-aware access for humans	Cloudflare Access, Google IAP, Zscaler
Observability	Audit all access decisions	Envoy access logs, OPA decision logs

Implementation: Istio/Linkerd — inject sidecar proxy, auto-mTLS between all pods. SPIFFE — universal workload identity (x509 SVIDs). OPA — "svc-a can call svc-b GET /api/orders but not DELETE". Short-lived certs (1h) — compromised cert expires quickly.

Real-world: Google BeyondCorp — no VPN, all access through identity-aware proxy. Netflix — mTLS everywhere via custom CA. Airbnb — SPIFFE for service identity. Cloudflare — Access replaces VPN for employee access.

Anti-patterns: VPN = trusted — once inside VPN, full access (flat network). IP-based allow-lists — IPs change, can be spoofed. Long-lived certs — compromised cert valid for years. No east-west encryption — internal traffic sniffable.

DDoS Defense

Layers of mitigation from edge to origin — absorb volumetric attacks, filter application-layer floods

▸ DDoS Defense — Multi-Layer Protection

Layer	Attack Type	Defense	Tools
L3/L4 (Network)	SYN flood, UDP amplification, ICMP flood	Anycast absorption, scrubbing centers, BGP blackhole	Cloudflare Magic Transit, AWS Shield Advanced
L7 (Application)	HTTP flood, slowloris, cache-busting	WAF rules, rate-limit per IP/JA3/path, geo-block	Cloudflare WAF, AWS WAF, Akamai Kona
Bot / Credential	Credential stuffing, scraping, account takeover	CAPTCHA, device fingerprint, behavioral ML	Cloudflare Bot Mgmt, PerimeterX, DataDome
DNS	DNS amplification, NXDOMAIN flood	Anycast DNS, rate-limit queries, DNSSEC	Route 53 Shield, Cloudflare DNS
API	API abuse, enumeration, resource exhaustion	API gateway rate-limit, token bucket, signed requests	Kong, Apigee, AWS API Gateway

▸ DDoS Attack Types & Signatures

Volumetric (L3/L4)

SYN flood: exhaust connection table
UDP amplification: DNS/NTP/memcached reflection
ICMP flood: saturate bandwidth
Scale: 1-3 Tbps (record attacks)
Defense: absorb at edge, can't filter at app

Application (L7)

HTTP flood: legitimate-looking requests at scale
Slowloris: hold connections open slowly
Cache-busting: unique URLs bypass CDN
Scale: 10K-10M RPS
Defense: WAF, rate-limit, CAPTCHA, behavioral

Defense-in-depth: Edge (Cloudflare/Shield) absorbs volumetric → WAF filters L7 → Rate limiting per IP/path → Bot management challenges suspicious → App gracefully degrades under remaining load. Each layer reduces attack surface for the next.

Preparation: Always-on protection (not on-demand — too slow to activate). Runbook for escalation. Load test your own infrastructure to know breaking points. Separate static assets on CDN (attackers can't exhaust origin for static content).

Anti-patterns: No edge protection — origin directly exposed. On-demand only — takes 10+ min to activate during attack. Single-region — no geographic distribution to absorb. Exposing origin IP — attackers bypass CDN directly.

Real-world: Cloudflare — mitigated 71M RPS attack (2023). AWS Shield Advanced — auto-mitigates with DDoS Response Team. Google Cloud Armor — adaptive protection with ML. GitHub — survived 1.35 Tbps memcached amplification (2018).

Event Loop & I/O Multiplexing

How one thread handles 100K+ connections — the engine behind Redis, Nginx, and Node.js

Multiplexer	OS	Scalability	Used By
select	All	O(n) scan, max 1024 fds	Legacy, portable
poll	All	O(n) scan, no fd limit	Slightly better select
epoll	Linux	O(1) per ready fd, millions of fds	Redis, Nginx, Node.js
kqueue	BSD/macOS	O(1) per ready fd	Nginx (macOS), FreeBSD
io_uring	Linux 5.1+	O(1) + zero-copy, async submission	Next-gen (Tigerbeetle, Seastar)

The reactor pattern: Register interest in events (read/write ready) → epoll_wait() blocks until events fire → dispatch to handlers → loop. One thread handles thousands of concurrent connections because it never waits on any single one.

Interview application: When explaining why Redis/Nginx are fast with one thread, say: "It uses epoll-based I/O multiplexing — the kernel notifies which sockets have data, so the thread only processes ready connections and never blocks. This gives concurrency without threads."