Eval dashboard
The AI is not shipped on vibes. A labelled suite runs every case through the analyst + grounding guardrail and scores it. Grounding violations must be zero.
No eval run yet. Click Run eval suite to score the AI.
Each run makes a few real model calls — it takes ~10–20s.