Last Layer Logits to Logic: Empowering LLMs with Logic-Consistent Structured Knowledge Reasoning
Songze Li, Zhiqiang Liu, Zhaoyan Gong, Xiaoke Guo, Zhengke Gui, Huajun Chen, Wen Zhang

TL;DR
This paper introduces the Logits-to-Logic framework that improves large language models' ability to produce logic-consistent responses in structured knowledge reasoning tasks by correcting their output logits.
Contribution
The paper presents a novel logits-based method to enhance logic consistency in LLM outputs, addressing the limitations of previous prompt-based approaches.
Findings
Significant improvement in logic consistency across KGQA benchmarks
State-of-the-art performance achieved with the proposed method
Effective correction of logical defects in LLM outputs
Abstract
Large Language Models (LLMs) achieve excellent performance in natural language reasoning tasks through pre-training on vast unstructured text, enabling them to understand the logic in natural language and generate logic-consistent responses. However, the representational differences between unstructured and structured knowledge make LLMs inherently struggle to maintain logic consistency, leading to \textit{Logic Drift} challenges in structured knowledge reasoning tasks such as Knowledge Graph Question Answering (KGQA). Existing methods address this limitation by designing complex workflows embedded in prompts to guide LLM reasoning. Nevertheless, these approaches only provide input-level guidance and fail to fundamentally address the \textit{Logic Drift} in LLM outputs. Additionally, their inflexible reasoning workflows cannot adapt to different tasks and knowledge graphs. To enhance…
Peer Reviews
Decision·Submitted to ICLR 2026
The idea of aligning the logits of LLM outputs with knowledge graph logic to ensure that LLM outputs follow KG information is interesting. However, the idea is somewhat difficult to understand. LLM outputs are probability distributions over tokens, while in a KG, an entity name or relation may consist of multiple words. It is unclear how this mapping is performed. The experimental results are promising, but in Table 1, only Hit@1 is reported; F1 scores are not provided. It would be better to in
The paper lacks an introduction to preliminary knowledge, which makes it difficult to understand and to assess the true contributions. Moreover, the contribution seems somewhat incremental. No example is provided in the paper. It would be helpful to include a complete example that illustrates the entire process, from beginning to end, to facilitate understanding.
+ The manuscript addresses the issue of logic drift by directly intervening in the logits. + Extensive experiments show significant improvements. + The framework demonstrates significant computational efficiency
- Evaluation mainly focus on LLaMa and Qwen model. Other foundation models or larger models are not validated due to resource constraints. It would benefit analyzing how the framework scales with larger models. - Although the class-agnostic loss helps prevent overfitting to specific classes, the overall framework may stil struggle with class imbalance or biased training samples - The framework relies heavily on the predefined hyperparameters for the loss terms. While the paper show an empirical
1. This paper is the first KGQA method that considers output-level control that corrects LLM outputs by manipulating logits. 2. The results look promising in a few benchmarks. It adapts to different KGs and tasks (multi-hop QA, slot filling). 3. The form of displaying results is good, where multiple different kinds of figures are used to show results.
1. The information in Figure 1 is not clear. "current approaches" is very vague. I cannot get which methods and datasets are tested. 2. The "logic" formulated in this paper mainly depends on paths, not true logics. Along with this problem, the novelty of this paper is a concern where there are many path-based methods like RoG (Luo et. al.). The authors should have a separate subsection in related works to discuss methods lie in this type. 3. Based on the results in Figure 2, the main technique t
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Multimodal Machine Learning Applications
