Position: Logical Soundness is not a Reliable Criterion for Neurosymbolic Fact-Checking with LLMs
Jason Chan, Robert Gaizauskas, Zhixue Zhao

TL;DR
This paper critiques the use of formal logic as a sole criterion for neurosymbolic fact-checking with LLMs, highlighting its failure to detect misleading claims due to human inference divergence.
Contribution
It presents a cognitive science-informed typology showing where logical soundness fails to align with human reasoning, proposing a complementary validation approach.
Findings
Logical soundness often elicits unsupported human inferences.
Formal logic alone cannot reliably detect misleading claims.
Leveraging LLMs' human-like reasoning can improve fact-checking accuracy.
Abstract
As large language models (LLMs) are increasing integrated into fact-checking pipelines, formal logic is often proposed as a rigorous means by which to mitigate bias, errors and hallucinations in these models' outputs. For example, some neurosymbolic systems verify claims by using LLMs to translate natural language into logical formulae and then checking whether the proposed claims are logically sound, i.e. whether they can be validly derived from premises that are verified to be true. We argue that such approaches structurally fail to detect misleading claims due to systematic divergences between conclusions that are logically sound and inferences that humans typically make and accept. Drawing on studies in cognitive science and pragmatics, we present a typology of cases in which logically sound conclusions systematically elicit human inferences that are unsupported by the underlying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
