Excited to share our EMNLP 2025 (Main) paper: "Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with LLMs." How consistent is English Wikipedia? With the help of LLMs, we estimate 80M+ internally inconsistent facts (~3.3%). Small in percentage, large at corpus scale.
This Corpus-Level Inconsistency Detection task is needle-in-a-haystack hard.
Meet CLAIRE: an agent that surfaces potential contradictions with evidence and explanations.