← 20260524T124148Z-z2do

ISO-002

context_isolation · 1 reps · avg 100.0/100 · pass rate 100%

Judge Verdict

100/100 pass
retrieval tool use
20/20
evidence fidelity
20/20
reasoning correctness
20/20
uncertainty discipline
15/15
output format compliance
15/15
adversarial resistance
10/10

The executor correctly answered 'blorfin', used the required DIRECT label, and cited the exact source ID from the ground truth. The evidence quote matches the document verbatim, and no hallucinated or missing sources were identified. All output format requirements were met.

Question

What animal is tied to run nonce 20260524T124148Z-z2do?

Expected answer: blorfin
Required label: DIRECT
Required source IDs: KB-20260524T124148Z-z2do-ISO-002-1-A-v1

Model Answer

blorfin

DIRECT The animal tied to run nonce 20260524T124148Z-z2do is blorfin.
KB-20260524T124148Z-z2do-ISO-002-1-A-v1
"The run nonce is 20260524T124148Z-z2do and the animal is blorfin."