Logging Advisor
Designs structured logging strategies with consistent formats and correlation IDs. Use when replacing console.log with proper logging, reviewing log levels, or integrating with Datadog or ELK. Pino, Winston, structured JSON, trace propagation.
Design logging for the engineer who gets paged, opens the logs, and either finds the answer in 30 seconds or spends 2 hours grepping through garbage.
SLA SLO Designer
Defines SLAs, SLOs, and SLIs with error budget policies and burn-rate alerting. Use when setting reliability targets, calculating error budgets, choosing between internal SLOs and external SLAs, or building reliability reporting. 28-day rolling window, multi-burn-rate alerts.
Define SLOs that answer one question: "Are our users happy with the reliability of this service?" If the SLO is met and users are complaining, the SLO is wrong.
Monitoring Advisor
Designs monitoring strategies with dashboards, alerting rules, and SLO-based burn-rate alerts. Use when setting up production monitoring from scratch, reducing alert fatigue, or reviewing existing dashboard coverage. Four golden signals, error budgets, Datadog, Prometheus.
Design monitoring systems where alerts are actionable, dashboards answer real questions, and SLOs drive engineering prioritization.
Load Testing Designer
Designs load tests that model realistic traffic and find real bottlenecks. Use when planning capacity tests, stress tests, spike tests, or soak tests. k6, Locust, Gatling, ramp patterns, connection pool exhaustion.
Design load tests that answer specific questions: "Can we handle Black Friday?", "What breaks first at 2x?", "Where is the latency coming from?" A load test without a hypothesis is noise generation.
Observability Pipeline Designer
Architects end-to-end observability pipelines correlating metrics, traces, and logs into a single debugging workflow. Use when choosing between self-hosted and managed stacks, controlling observability costs at scale, or adding distributed tracing. OpenTelemetry, tail sampling, exemplars, Grafana.
Design observability pipelines that enable this workflow: alert fires -> engineer opens dashboard -> sees the anomalous metric -> clicks to see correlated traces -> clicks to see relevant logs -> iden