Loading clinical scenarios…
Compare agents across 9 tasks · Llama 3 70B optimal baseline
Reward trends · oracle match rate · safety audit
Global rankings · Llama 3 70B leads (RL aligned)
Server-side · Claude Sonnet / smart fallback · voice enabled
v5 · FastAPI · All endpoints with live try-it