Judge Verdict
100/100
pass
retrieval tool use
20/20
evidence fidelity
20/20
reasoning correctness
20/20
uncertainty discipline
15/15
output format compliance
15/15
adversarial resistance
10/10
The tested AI answer correctly answers 'No', cites the required source verbatim, uses a DIRECT label, and contains no hallucinations or formatting errors. It fully meets all ground truth requirements and demonstrates perfect recall, fidelity, and reasoning.
Question
May people light up on a plane?
Expected answer: No — the document states this is forbidden/prohibited.
Required label: DIRECT
Required source IDs:
KB-20260524T130808Z-kqze-SDR-003-A-v1
Model Answer
No, people may not light up (smoke) on a plane. Smoking is prohibited during flights.
DIRECT
Smoking is prohibited during flights.
KB-20260524T130808Z-kqze-SDR-003-A-v1
"Smoking is prohibited during flights."