Consistency-Guided Decoding with Proof-Driven Disambiguation for Three-Way Logical Question Answering
Tianyi Huang, Ming Hou, Jiaheng Su, Yutong Zhang, and Ziling Zhang

TL;DR
This paper introduces CGD-PD, a lightweight decoding method that improves three-way logical question answering by ensuring consistency and resolving uncertainties through proof-driven disambiguation, significantly boosting accuracy.
Contribution
The paper proposes a novel test-time layer, CGD-PD, that enhances logical QA by enforcing negation consistency and targeted disambiguation, with minimal additional model calls.
Findings
Up to 16% accuracy improvement on FOLIO benchmark.
Reduces the number of 'Unknown' predictions.
Achieves consistency in negation handling across LLMs.
Abstract
Three-way logical question answering (QA) assigns to a hypothesis given a premise set . While modern large language models (LLMs) can be accurate on isolated examples, we identify two recurring failure modes in 3-way logic QA: (i) negation inconsistency, where answers to and violate the deterministic label mapping, and (ii) epistemic , where the model predicts due to uncertainty or instability even when entails one side. We present CGD-PD, a lightweight test-time layer that (a) queries a single 3-way classifier on both and a mechanically negated form of , (b) projects the pair onto a negation-consistent decision when possible, and (c) invokes a proof-driven disambiguation step that uses targeted binary entailment probes to selectively resolve outcomes, requiring only an average of 4-5 model calls. On the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
