Same alert, two agents — naive vs Backstop, on a real cluster.
The same poisoned alert hits both agents. The naive one acts on a hallucinated diagnosis and takes the production database to zero. Backstop catches the bad output, re-routes to a stronger model, and rolls the real deploy back — on a live Kubernetes cluster, through the TrueFoundry gateway.
destructive actions blocked by guardrails
the fail-safe agent's outcome
what the unguarded agent did
deployments ready, both namespaces
Capabilities engaged this run.
Real Kubernetes, polled every 2s.
Connecting to cluster…