Systems
Why Engineers Are Obsessed With P99
If you only watch the average, you are watching the wrong number. P99 is where the money leaks, where the outages start, and where your users quietly decide to leave. testing
Apr 20, 2026·12 min read
If you only watch the average, you are watching the wrong number. P99 is where the money leaks, where the outages start, and where your users quietly decide to leave. testing
The most dangerous distributed systems failures are the ones where everything looks fine, until it doesn't. Here's the failure mode that buries on-call engineers. testing
I thought I understood API retries. Then I watched Arpit Bhayani explain the thundering herd problem, and realized every retry I'd ever written was either part of the fix - or part of the fire. testing
Idempotency is the single most underrated contract in distributed systems - and ignoring it is how you end up charging customers twice at 3am. testing