Runs and Results
Purpose
Use Results to understand what happened during a run and what needs to change next.
When to use it
Open a run or result view after starting an eval or playground run.
Prerequisites
- You have started at least one run.
Steps
- Open the run detail page from Evals, Playground, or the project Results area.
- Review the run status and latest step.
- Check how many results completed successfully or failed.
- Inspect per-row outputs, correctness, and any scoring details.
- Review console output or error details when present.
- Export results when you need a CSV copy for analysis outside Spec27.
Expected result
You can explain what the run did, what passed or failed, and where to iterate next.
What to know
- Run detail pages can show output, status, correctness, and logs together.
- Judge-based runs may include explanations and vote details.
- Adversarial runs can surface adversarial inputs instead of only the seeded primary inputs.