Skip to main content

DLR Processor — Service Risk Register

Status: populated Owner: Platform Engineering Last updated: 2026-04-18

Risk Matrix

IDRiskLikelihoodImpactSeverityMitigationOwner
RISK-DLR-01High orphan rate causes billing gapsMediumHighHIGHOrphan monitoring + reconciliation job; alert at 0.5%Platform Eng
RISK-DLR-02PG write contention at peak DLR volumeLowMediumMEDIUMConnection pooling (PgBouncer); ON CONFLICT fast path; partition tablePlatform Eng
RISK-DLR-03smpp-connector schema change breaks inbound parsingMediumHighHIGHPact contract tests; tolerant reader pattern; schema versioningsmpp-connector team
RISK-DLR-04NATS stream retention too short → DLR loss on processor downtimeLowHighHIGHSet retention to 24 h; monitor consumer lag; HPA on pending countSRE
RISK-DLR-05PII leakage via orphaned_receipts rawPayloadLowCriticalCRITICALTablespace encryption; restricted SELECT grants; no PII in logsSecurity
RISK-DLR-06Outbox relay failure causes stale billing/webhookLowHighHIGHOutbox pending alert; automatic retry; on-call runbookPlatform Eng
RISK-DLR-07Duplicate DLR flood from operator degrades performanceMediumMediumMEDIUMIdempotency index exits fast; plan Redis bloom filter for high-volume operatorsPlatform Eng
RISK-DLR-08Race condition: DLR arrives before SENT status written to orchMediumMediumMEDIUMCorrelation retry with 3s backoff before orphaning; reconciliation jobPlatform Eng

Risk Review Cadence

Risk register reviewed monthly in Platform Engineering architecture sync. New risks added as incidents or near-misses occur.