Two-way Evidence self-Alignment based Dual-Gated Reasoning Enhancement
Kexin Zhang, Junlan Chen, Daifeng Li, Yuxuan Zhang, Yangyang Feng, Bowen Deng, and Weixu Chen

TL;DR
This paper introduces ESA-DGR, a unified framework with two modules that enhance large language models' reasoning by aligning evidence and fusing knowledge, significantly improving accuracy on knowledge-intensive tasks.
Contribution
It proposes a novel two-way evidence self-alignment and dual-gated reasoning modules, jointly trained to improve evidence utilization and reasoning accuracy in LLMs.
Findings
Achieves 4% higher exact match scores on KIMSR datasets.
Outperforms state-of-the-art fine-tuning methods.
Demonstrates robustness in reasoning with uncertain evidence.
Abstract
Large language models (LLMs) encounter difficulties in knowledge-intensive multi-step reasoning (KIMSR) tasks. One challenge is how to effectively extract and represent rationale evidence. The current methods often extract semantically relevant but logically irrelevant evidence, resulting in flawed reasoning and inaccurate responses. We propose a two-way evidence self-alignment (TW-ESA) module, which utilizes the mutual alignment between strict reasoning and LLM reasoning to enhance its understanding of the causal logic of evidence, thereby addressing the first challenge. Another challenge is how to utilize the rationale evidence and LLM's intrinsic knowledge for accurate reasoning when the evidence contains uncertainty. We propose a dual-gated reasoning enhancement (DGR) module to gradually fuse useful knowledge of LLM within strict reasoning, which can enable the model to perform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications
