Skip to main content

SERVICE_RISK_REGISTER — bff-backoffice-service

Sibling: SERVICE_READINESS · FAILURE_MODES · SECURITY_MODEL

Living register. Quarterly review by Frontend Platform tech lead + Desktop platform tech lead + SRE + security.

Severity: L low, M medium, H high, C critical. Status: open, monitored, accepted, closed.

1. Strategic risks

IDRiskL×IMitigationOwnerStatusReview
R-S-1Desktop fleet drift across versions causes feature-flag matrix explosionH×MApp-version floor; auto-update channel; release coordination calendarFE Platform + Desktopmonitoredquarterly
R-S-2Hostile network (corporate proxy strips SSE) creates uneven UXM×MPolling fallback; per-device transport preference; tested in chaos drillFE Platformmonitoredquarterly
R-S-3Tenant onboarding requires per-tenant Cloud Armor tuningM×MOperator-cohort baseline traffic profile; tenant-specific overridesSREopenquarterly
R-S-4Sharia-compliance regression in AI suggestions surfaceL×HcomplianceProfile propagation tested; AI surfaces filtered at orchestrator; refusal contract enforced; quarterly reviewCompliance + FEmonitoredquarterly
R-S-5Phase 2 mobile backoffice introduces split-brain riskM×MMobile, if introduced, gets its own BFF or constrained subset; design review requiredArchitectureopenannual

2. Performance & reliability risks

IDRiskL×IMitigationOwnerStatusReview
R-P-1Dashboard composer p95 unbounded if a single upstream regressesH×MPer-widget deadlines + skeletons; per-upstream SLO trackedFE Platformmonitoredquarterly
R-P-2Memorystore session-tier eviction loses sessions during scale eventL×Hnoeviction policy; alarm; auto-scale to 6 GiB during peakSREmonitoredquarterly
R-P-3SSE active-connection storm on coordinated reconnect (e.g., post-incident)M×HPer-instance conn cap; pre-stop drain; gradual reconnect jitter on desktop sideFE Platformmonitoredquarterly
R-P-4Sync handshake bottleneck during fleet-wide reconnectM×Hsync-service rate-limit + jitter; BFF cursor cache absorbs reads; chaos drillSRE + Syncmonitoredquarterly
R-P-5Cloud SQL HA failover > 60 s during peakL×MDR drill verifies; idem keys absorb; trended monthlySREacceptedannual

3. Security risks

IDRiskL×IMitigationOwnerStatusReview
R-Sec-1Device key extraction from OS keychain (rooted device, malware)L×COS keychain isolation; per-device revocation via iam-service; audit alarm on revocation; MFA on sensitive actionsSecurity + Desktopmonitoredquarterly
R-Sec-2DPoP replay bypass in BFFL×CReplay cache (Memorystore single-use jti); DPoP fuzz suite; pen testSecurity + FEmonitoredquarterly
R-Sec-3MFA bypass on lock revocationL×CMFA attestation single-use; consume-then-call ordering; audit; alertsSecurity + FEmonitoredquarterly
R-Sec-4Insider lock-revoke abuseM×HMFA gate; full audit; activity-ledger anomaly detector; quarterly reviewSecurity + Compliancemonitoredquarterly
R-Sec-5Cross-tenant cache leakL×CTenant-scoped keys; nightly synthetic probe; tenant-isolation suiteFE Platformmonitoredquarterly
R-Sec-6Insider folio adjustment fraudM×MMFA gate above threshold; audit; nightly reconciliationCompliance + FEmonitoredquarterly
R-Sec-7Force-logout latency too high (revoked operator continues acting)L×HRefresh-time backstop; SSE channel monitored; e2e latency testedSecurity + FEmonitoredquarterly
R-Sec-8Audit log gap (event published but ledger row missing)L×HReconciliation job (lock_audit_completeness alert); halt lock proxy on gapSecurity + SREmonitoredquarterly

4. Compliance & data risks

IDRiskL×IMitigationOwnerStatusReview
R-C-1DSR for operator returns inconsistent resultsM×Mtenant-service orchestrates DSR; this BFF documented as ephemeral mirrorData Stewardmonitoredquarterly
R-C-2EU operator data residency (Memorystore in asia-south1)M×MPhase 2 region affinity; operator data short-lived; legal review confirmedLegalacceptedannual
R-C-3Audit log export to BigQuery delayedL×MDaily export; nightly reconciliation; SRE alert on lagSREmonitoredquarterly
R-C-4Notes carry inadvertent PIIM×MTruncated to 200 chars in cold mirror; periodic synthetic PII probeData Steward + FEmonitoredquarterly

5. Operational risks

IDRiskL×IMitigationOwnerStatusReview
R-O-1Bus-factor 1 on lock-action audit code pathM×HPair-on-call; runbook completeness reviewEng Managermonitoredannual
R-O-2Schema drift from upstream service released without contract testL×HPact verification gate; OpenAPI diff gate; SCHEMA_DRIFT alertPlatform Engmonitoredquarterly
R-O-3Force-logout drill rare; latency regression undetectedM×MQuarterly chaos drill; SLO tracked monthlySREmonitoredquarterly
R-O-4Auto-update server outage prevents desktop fleet from upgradingM×MIndependent monitoring; secondary mirror; runbookDesktop + SREmonitoredquarterly
R-O-5App-version-floor change breaks legitimate clientsL×HTwo-step release; advisory before enforcement; rollback runbook; coordination calendarFE + Desktopmonitoredquarterly

6. Cost risks

IDRiskL×IMitigationOwnerStatusReview
R-Cost-1SSE long-running connections drive instance countM×MPer-device 1-conn cap; idle timeout; cost dashboardSREmonitoredquarterly
R-Cost-2Pub/Sub volume from telemetry exceeds budgetM×MSample rates per event; cost alarmSREmonitoredquarterly
R-Cost-3Activity ledger storage growthL×M90 d retention + BigQuery export; daily archivalSREmonitoredquarterly

7. Risk acceptance log

IDDate acceptedAccepted byReasonRe-evaluation
R-P-52026-04-15SREHA failover within SLA on every drill in last 12 months2027-04-15
R-C-22026-04-15LegalOperator data short-lived; Phase 2 will add region affinity2027-04-15

8. Review cadence

  • Quarterly: Frontend Platform + Desktop platform + SRE + security.
  • Per major release: any touched row re-rated.
  • Per incident: post-mortem owners audit register.