FAQ
What is a specification in plain language?
A specification is the reusable evaluation recipe. It tells Spec27 what dataset to use and what attack methods or judge setup belong with it.
When should I use Playground instead of Evals?
Use Playground for fast experimentation. Use Evals for named, repeatable workflows.
Do I need a judge for every evaluation?
No. Use strict equality or permitted values when they are sufficient. Add a judge only when evaluation needs interpretation.
Why is my agent preview blocked?
The most common reason is that the agent requires one or more project Secrets that do not exist yet.
Can the same specification be reused?
Yes. A specification can be used by multiple evals.
Where should I look after a run finishes?
Open the run detail page first, then review the project’s Results views if you want a broader summary.