Test Cases

100 cases across 24 categories

critical high medium low

adversarial (5 cases)

alias (3 cases)

causal (2 cases)

conflicting evidence (3 cases)

context isolation (5 cases)

cross source (10 cases)

deduplication (2 cases)

direct vs inferred (10 cases)

entity disambiguation (3 cases)

exact phrase (5 cases)

exact source id (5 cases)

identifier (3 cases)

judge reliability (5 cases)

missing source (5 cases)

needle haystack (3 cases)

negation (3 cases)

numerical (3 cases)

provenance (2 cases)

raw text (5 cases)

semantic drift (3 cases)

storage (5 cases)

supersession (2 cases)

update versioning (5 cases)

verified (3 cases)