The Alignment Bottleneck in Decomposition-Based Claim Verification
Mahmud Elahi Akhter, Federico Ruggeri, Iman Munire Bilal, Rob Procter, and Maria Liakata

TL;DR
This paper investigates the challenges in claim decomposition for verification, highlighting evidence alignment issues and error profiles, and demonstrates that precise evidence alignment improves performance.
Contribution
It introduces a new dataset with real-world claims and evidence, and systematically evaluates how evidence alignment impacts claim verification performance.
Findings
Decomposition improves verification only with granular, aligned evidence
Standard repeated evidence setups often degrade performance
Conservative abstention reduces error propagation
Abstract
Structured claim decomposition is often proposed as a solution for verifying complex, multi-faceted claims, yet empirical results have been inconsistent. We argue that these inconsistencies stem from two overlooked bottlenecks: evidence alignment and sub-claim error profiles. To better understand these factors, we introduce a new dataset of real-world complex claims, featuring temporally bounded evidence and human-annotated sub-claim evidence spans. We evaluate decomposition under two evidence alignment setups: Sub-claim Aligned Evidence (SAE) and Repeated Claim-level Evidence (SRE). Our results reveal that decomposition brings significant performance improvement only when evidence is granular and strictly aligned. By contrast, standard setups that rely on repeated claim-level evidence (SRE) fail to improve and often degrade performance as shown across different datasets and domains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Bayesian Modeling and Causal Inference
