← 20260524T113756Z-kduj

JUDGE-005

judge_reliability · 1 reps · avg 45.0/100 · pass rate 0%

Judge Verdict

45/100 fail
retrieval tool use
5/20
evidence fidelity
10/20
reasoning correctness
5/20
uncertainty discipline
8/15
output format compliance
14/15
adversarial resistance
3/10
missing_required_source_ids incorrect_label

The candidate answer is factually correct but fails to include required source KB-20260524T113756Z-kduj-JUDGE-005-A-v1 and incorrectly labels the claim as DIRECT instead of CROSS_SOURCE. The missing source breaks the evidential chain, so the evaluation is a fail with low scores in retrieval fidelity and reasoning correctness.

Question

Where is Treasure Map located?

Expected answer: Skull Island
Required label: CROSS_SOURCE
Required source IDs: KB-20260524T113756Z-kduj-JUDGE-005-A-v1, KB-20260524T113756Z-kduj-JUDGE-005-B-v1

Model Answer

Skull Island

DIRECT Treasure Map is in Skull Island.
KB-20260524T113756Z-kduj-JUDGE-005-B-v1
"The thing with property X is located in Skull Island."