Governance

Complete operator oversight without requiring any operator involvement in normal resolution. Live supervision surfaces active conversations with per-conversation sentiment, confidence, and tier state via SSE. Prompt A/B testing runs with statistical significance tracking so you know before you promote a new version.

  • Real-time supervision: live feed of all active conversations across every channel
  • Agent run replay: re-execute any conversation against any model version
  • Prompt A/B testing with statistical significance tracking and one-click rollback
  • OpenTelemetry export for Datadog, Grafana, Honeycomb, any OTEL backend

Live Supervision

SSE Live
C-2041VoiceT1😊96%Resolving
C-2040ChatT2😐74%Escalated
C-2039EmailT1😊93%Resolved
C-2038SMST3😞88%DLQ Review

A/B Test — Prompt v12 vs v13

v12 resolution rate76.2%
v13 resolution rate79.8%
Statistical sig.p = 0.031 ✓

Circuit Breakers

Tier 1 (Haiku)CLOSED
Tier 2 (Sonnet)CLOSED
Tier 3 (Opus)CLOSED