Temporal Reasoning Is Not the Bottleneck: A Probabilistic Inconsistency Framework for Neuro-Symbolic QA
Tran Quang Liem

TL;DR
This paper introduces a neuro-symbolic framework with a Probabilistic Inconsistency Signal that isolates representation errors from reasoning failures, significantly improving temporal reasoning accuracy in QA tasks.
Contribution
It presents a novel decoupled architecture that lifts text into explicit event graphs and uses probabilistic signals to detect structural errors, shifting focus from reasoning to representation quality.
Findings
Achieves 100% accuracy on temporal arithmetic benchmarks with correct representations.
Maintains 75.1% accuracy in noisy QA settings while enabling failure localization.
Reframes temporal QA as a structural alignment problem rather than a reasoning challenge.
Abstract
Despite significant advances, large language models (LLMs) continue to exhibit brittle performance on complex temporal reasoning tasks. This failure mode is widely attributed to inherent deficits in autoregressive logical deduction. In this paper, we challenge this prevailing narrative, demonstrating that temporal reasoning is not the fundamental bottleneck; rather, the locus of failure lies in unstructured text-to-event representation. We introduce a novel neuro-symbolic question-answering framework governed by a Probabilistic Inconsistency Signal (PIS) that explicitly isolates perceptual errors from reasoning failures. By lifting unstructured text into explicit event graphs and interval constraints, our architecture strictly decouples semantic extraction from a symbolic reasoning engine. To robustly detect structural breaks, the PIS elegantly unifies symbolic credal intervals with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
