Backend Architecture BE · 01 · 10

Request lifecycle: instrument and harden a service

Hands-on project — build a small HTTP service, instrument all seven stops, then deliberately trigger and fix one failure at each stop, proving every fix with before/after numbers.

BE Senior ◷ 240 min

Level

FoundationsJuniorMiddleSenior

Reading the seven stops is not the same as watching each one fail and fixing it under load. Build a small service, instrument the whole lifecycle, then drive each stop into its characteristic failure and harden it — with evidence at every step.

Goal

Turn the unit’s map into a reproducible loop: instrument each stop, reproduce its failure mode under load, apply the targeted fix, and verify with before/after numbers — so that in a real incident you jump straight to the right stop.

Project

0 of 7

Objective

Build a small HTTP service (Go, Node, or JVM) behind a reverse proxy, instrument all seven lifecycle stops, then deliberately trigger one failure at each stop and harden it — proving every fix with measurements under identical load.

Requirements

Acceptance criteria

A before/after table per stop: the failure symptom, the metric that exposed it (overflow count, 431 rate, serialization ms, RSS, p99), and the value after the fix — all measured under identical load, not estimated.
The accept-queue overflow no longer produces silent drops, the protected route is unreachable without auth, oversized headers/bodies are rejected before buffering, and the streaming endpoint holds flat memory under a slow client.
The fan-out endpoint's p99 is bounded by the propagated deadline rather than summing per-hop timeouts, demonstrated with one downstream artificially slowed.
A one-page lifecycle map for the service: for each of the seven stops, the metric you watch, the failure it catches, and the fix you applied — the artifact you would hand an on-call engineer.

Senior stretch

Add an on-call runbook: given a symptom (empty logs under load, 431 for logged-in users, p99 spike tracking payload size, RSS climbing, slow page despite healthy services), the stop to suspect and the first diagnostic.
Add hedged requests at p95 on the fan-out path and measure the p99.9 improvement against the extra request volume; show the load increase stays in the low single digits.
Add an HTTP/2 variant of the streaming endpoint and show how per-stream flow-control windows plus head-of-line blocking change the backpressure behaviour versus HTTP/1.1.
Add a request-smuggling guard: reject requests carrying both Content-Length and Transfer-Encoding, and demonstrate the proxy and app agree on body framing.

Recap

This is the loop you run in every real lifecycle incident: instrument each stop first, reproduce the failure under load, apply the targeted fix (raise the backlog, reorder middleware, paginate and use a schema serializer, honour backpressure with pipeline(), propagate a deadline and hedge the tail), and verify with before/after numbers under identical load. Doing it once on a toy service turns the seven-stop map into the reflex that, given a symptom, sends you straight to the one stop that owns it.

Something unclear?

Ask a question about this lesson. Questions are anonymous and go straight to the author to make the lesson better.