Judge Verdict
100/100
pass
retrieval tool use
20/20
evidence fidelity
20/20
reasoning correctness
20/20
uncertainty discipline
15/15
output format compliance
15/15
adversarial resistance
10/10
The answer exactly reproduces the required sentence, preserving the em dash (—) character as mandated. The correct source ID is cited, the quote is verbatim, and the DIRECT label is properly applied. All evaluation criteria are fully satisfied.
Question
Return the exact label sentence from the brain document, preserving all punctuation.
Expected answer: The label read: NEXUS—PRIME, not NEXUS-PRIME.
Required label: DIRECT
Required source IDs:
KB-20260524T113756Z-kduj-PHR-003-A-v1
Model Answer
The label read: NEXUS—PRIME, not NEXUS-PRIME.
DIRECT
The label read: NEXUS—PRIME, not NEXUS-PRIME.
KB-20260524T113756Z-kduj-PHR-003-A-v1
"The label read: NEXUS—PRIME, not NEXUS-PRIME."