Skip to main content

Analytics Service — Service Readiness

Status: populated Owner: Platform Engineering + SRE Last updated: 2026-04-18

Service is production-ready only when EVERY box below is checked.

Docs

  • All 17 service docs complete (no stubs remain).

Code + Tests

  • TypeScript strict, zero errors.
  • ESLint passes.
  • Unit coverage: use cases ≥ 90%, VOs 100%.
  • Integration tests pass: billing-event-upsert, dlr-event-upsert, idempotency, rollup, summary-api, account-scope, backfill.
  • Contract tests green: sms.billed.v1 and sms.dlr.inbound.v1 schema registry.
  • Data accuracy test in staging passes (sum of 10,000 injected events matches summary total).

Security

  • security-reviewer agent run, zero critical/high.
  • Account scope enforcement tested (/accounts/:id/usage with wrong account → 403).
  • No MSISDN or message body in any log line (Pino transport verified).
  • mTLS + NetworkPolicy enforced in staging.

Observability

  • All metric families at /metrics.
  • All 6 alerts configured with runbooks.
  • Consumer lag dashboard visible in Grafana.
  • Data freshness dashboard visible.

Infra / Rollout

  • Helm chart + Terraform module committed.
  • HPA based on anlyt_nats_consumer_pending tested in staging.
  • CronJobs for rollup, purge, ETL deployed.
  • PG read replica confirmed in use for REST queries.
  • On-call rotation assigned.

Data

  • PG migrations applied in staging.
  • Partition maintenance for metrics_hourly and operator_performance scheduled.
  • Backfill from NATS stream replay completed in staging and verified.
  • Admin-dashboard cutover to analytics API tested in staging.

Sign-off

  • Tech lead ✅
  • SRE ✅
  • Security ✅
  • Product ✅