Consistency-Guided Decoding with Proof-Driven Disambiguation for Three-Way Logical Question Answering

Tianyi Huang; Ming Hou; Jiaheng Su; Yutong Zhang; and Ziling Zhang

arXiv:2604.06196·cs.CL·April 9, 2026

Consistency-Guided Decoding with Proof-Driven Disambiguation for Three-Way Logical Question Answering

Tianyi Huang, Ming Hou, Jiaheng Su, Yutong Zhang, and Ziling Zhang

PDF

TL;DR

This paper introduces CGD-PD, a lightweight decoding method that improves three-way logical question answering by ensuring consistency and resolving uncertainties through proof-driven disambiguation, significantly boosting accuracy.

Contribution

The paper proposes a novel test-time layer, CGD-PD, that enhances logical QA by enforcing negation consistency and targeted disambiguation, with minimal additional model calls.

Findings

01

Up to 16% accuracy improvement on FOLIO benchmark.

02

Reduces the number of 'Unknown' predictions.

03

Achieves consistency in negation handling across LLMs.

Abstract

Three-way logical question answering (QA) assigns $T r u e / F a l se / U nk n o w n$ to a hypothesis $H$ given a premise set $S$ . While modern large language models (LLMs) can be accurate on isolated examples, we identify two recurring failure modes in 3-way logic QA: (i) negation inconsistency, where answers to $H$ and $\neg H$ violate the deterministic label mapping, and (ii) epistemic $U nk n o w n$ , where the model predicts $U nk n o w n$ due to uncertainty or instability even when $S$ entails one side. We present CGD-PD, a lightweight test-time layer that (a) queries a single 3-way classifier on both $H$ and a mechanically negated form of $H$ , (b) projects the pair onto a negation-consistent decision when possible, and (c) invokes a proof-driven disambiguation step that uses targeted binary entailment probes to selectively resolve $U nk n o w n$ outcomes, requiring only an average of 4-5 model calls. On the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.