Quantifying Logical Consistency in Transformers via Query-Key Alignment
Eduard Tulchinskii, Anastasia Voznyuk, Laida Kushnareva, Andrei, Andriiainen, Irina Piontkovskaya, Evgeny Burnaev, Serguei Barannikov

TL;DR
This paper introduces a lightweight, scalable method to evaluate logical reasoning in transformers by analyzing query-key alignments, providing a reliable way to distinguish valid inferences in large language models.
Contribution
The paper presents a novel QK-score based approach that assesses logical coherence within transformer attention heads, enhancing evaluation robustness and scalability.
Findings
QK-score effectively separates valid and invalid inferences.
Method improves robustness against distractors.
Validated on models from 1.5B to 70B parameters.
Abstract
Large language models (LLMs) have demonstrated impressive performance in various natural language processing tasks, yet their ability to perform multi-step logical reasoning remains an open challenge. Although Chain-of-Thought prompting has improved logical reasoning by enabling models to generate intermediate steps, it lacks mechanisms to assess the coherence of these logical transitions. In this paper, we propose a novel, lightweight evaluation strategy for logical reasoning that uses query-key alignments inside transformer attention heads. By computing a single forward pass and extracting a "QK-score" from carefully chosen heads, our method reveals latent representations that reliably separate valid from invalid inferences, offering a scalable alternative to traditional ablation-based techniques. We also provide an empirical validation on multiple logical reasoning benchmarks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks
MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training
