Judges
Purpose
Use Judges when correctness depends on a rubric or judgment, not only exact matching.
When to use it
Create a judge when you need score-based evaluation with explanation or votes.
Prerequisites
- You have a project.
- You know the scoring criteria you want the judge to apply.
Steps
- Open Judges inside a project.
- Create a judge with a clear title and prompt or configuration.
- Open the judge detail page to review the configuration summary.
- Use the judge test flow with sample input and output.
- Confirm the returned score, explanation, and vote details match expectations.
- Use the judge in a judge-based specification or evaluation flow.
Expected result
You have a reusable judge that can score outputs across runs.
What to know
- Judges can be listed at both the project and organization level.
- Judge tests help validate scoring behavior before you use the judge in a broader workflow.
- Judge-based scoring is best for nuanced correctness checks.