Demo · storyboard click a scene to open · expand for the script

Storyboard

Diagnose → reproduce → eval → fix → guard

▶ Watch (1080p) 4K master ↓ ▶ Present
ACT 1

The problem

~45s · the SRE agent regresses
ACT 2

Fidian, behind the scenes

~80s · autonomous triage + eval anatomy
ACT 3

The developer

~35s · Claude Code + eval session
ACT 4

The fix (two phases)

~135s · 0% → 20% → 90%
ACT 5

Scale & the closed loop

~50s

Branches

the four use cases — Debug is rendered; the others are script-only today

🛠️ Debug

live

Fix a reported regression — a thumbs-down in the channel becomes a root cause, a fix, and a permanent eval. 0% → 20% → 90% on Multi-Hop Causal Analysis · PR #247.

▶ Present branch

🧭 Expand

script-only

Build coverage on an agent with no evals — point Fidian at the source + live agent, 12 candidates in 3 buckets, decide the ambiguities with Claude, promote as PR #129.

▶ Present branch

⚡ Optimize

script-only

Cut cost and latency, hold quality — sweep 5 candidate models, evolve the cheapest (prompt + code + caching) until it holds the bar, PR #131 with cost deltas per scenario.

▶ Present branch

🛡️ Guard

script-only

Enforce safety and scope — agent claims to have run remediation in prod → encode a plain-language policy → guardrail v0→v3 30/30 → deploy in monitor or block mode.

▶ Present branch

Presenter shows only the scene (no storyboard chrome). In it: /click next · back · 1–4 pick a branch on the architecture hub · Esc back here.
Tip: open each page fullscreen at 1280×800 @ 100% zoom. Hash routes let you jump to any beat & reset takes.