Last Layer Logits to Logic: Empowering LLMs with Logic-Consistent Structured Knowledge Reasoning

Songze Li; Zhiqiang Liu; Zhaoyan Gong; Xiaoke Guo; Zhengke Gui; Huajun Chen; Wen Zhang

arXiv:2511.07910·cs.CL·November 12, 2025

Last Layer Logits to Logic: Empowering LLMs with Logic-Consistent Structured Knowledge Reasoning

Songze Li, Zhiqiang Liu, Zhaoyan Gong, Xiaoke Guo, Zhengke Gui, Huajun Chen, Wen Zhang

PDF

Open Access 3 Reviews

TL;DR

This paper introduces the Logits-to-Logic framework that improves large language models' ability to produce logic-consistent responses in structured knowledge reasoning tasks by correcting their output logits.

Contribution

The paper presents a novel logits-based method to enhance logic consistency in LLM outputs, addressing the limitations of previous prompt-based approaches.

Findings

01

Significant improvement in logic consistency across KGQA benchmarks

02

State-of-the-art performance achieved with the proposed method

03

Effective correction of logical defects in LLM outputs

Abstract

Large Language Models (LLMs) achieve excellent performance in natural language reasoning tasks through pre-training on vast unstructured text, enabling them to understand the logic in natural language and generate logic-consistent responses. However, the representational differences between unstructured and structured knowledge make LLMs inherently struggle to maintain logic consistency, leading to \textit{Logic Drift} challenges in structured knowledge reasoning tasks such as Knowledge Graph Question Answering (KGQA). Existing methods address this limitation by designing complex workflows embedded in prompts to guide LLM reasoning. Nevertheless, these approaches only provide input-level guidance and fail to fundamentally address the \textit{Logic Drift} in LLM outputs. Additionally, their inflexible reasoning workflows cannot adapt to different tasks and knowledge graphs. To enhance…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 4Confidence 2

Strengths

The idea of aligning the logits of LLM outputs with knowledge graph logic to ensure that LLM outputs follow KG information is interesting. However, the idea is somewhat difficult to understand. LLM outputs are probability distributions over tokens, while in a KG, an entity name or relation may consist of multiple words. It is unclear how this mapping is performed. The experimental results are promising, but in Table 1, only Hit@1 is reported; F1 scores are not provided. It would be better to in

Weaknesses

The paper lacks an introduction to preliminary knowledge, which makes it difficult to understand and to assess the true contributions. Moreover, the contribution seems somewhat incremental. No example is provided in the paper. It would be helpful to include a complete example that illustrates the entire process, from beginning to end, to facilitate understanding.

Reviewer 02Rating 6Confidence 4

Strengths

+ The manuscript addresses the issue of logic drift by directly intervening in the logits. + Extensive experiments show significant improvements. + The framework demonstrates significant computational efficiency

Weaknesses

- Evaluation mainly focus on LLaMa and Qwen model. Other foundation models or larger models are not validated due to resource constraints. It would benefit analyzing how the framework scales with larger models. - Although the class-agnostic loss helps prevent overfitting to specific classes, the overall framework may stil struggle with class imbalance or biased training samples - The framework relies heavily on the predefined hyperparameters for the loss terms. While the paper show an empirical

Reviewer 03Rating 4Confidence 4

Strengths

1. This paper is the first KGQA method that considers output-level control that corrects LLM outputs by manipulating logits. 2. The results look promising in a few benchmarks. It adapts to different KGs and tasks (multi-hop QA, slot filling). 3. The form of displaying results is good, where multiple different kinds of figures are used to show results.

Weaknesses

1. The information in Figure 1 is not clear. "current approaches" is very vague. I cannot get which methods and datasets are tested. 2. The "logic" formulated in this paper mainly depends on paths, not true logics. Along with this problem, the novelty of this paper is a concern where there are many path-based methods like RoG (Luo et. al.). The authors should have a separate subsection in related works to discuss methods lie in this type. 3. Based on the results in Figure 2, the main technique t

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Multimodal Machine Learning Applications