Quantifying Logical Consistency in Transformers via Query-Key Alignment

Eduard Tulchinskii; Anastasia Voznyuk; Laida Kushnareva; Andrei; Andriiainen; Irina Piontkovskaya; Evgeny Burnaev; Serguei Barannikov

arXiv:2502.17017·cs.CL·March 17, 2025

Quantifying Logical Consistency in Transformers via Query-Key Alignment

Eduard Tulchinskii, Anastasia Voznyuk, Laida Kushnareva, Andrei, Andriiainen, Irina Piontkovskaya, Evgeny Burnaev, Serguei Barannikov

PDF

Open Access

TL;DR

This paper introduces a lightweight, scalable method to evaluate logical reasoning in transformers by analyzing query-key alignments, providing a reliable way to distinguish valid inferences in large language models.

Contribution

The paper presents a novel QK-score based approach that assesses logical coherence within transformer attention heads, enhancing evaluation robustness and scalability.

Findings

01

QK-score effectively separates valid and invalid inferences.

02

Method improves robustness against distractors.

03

Validated on models from 1.5B to 70B parameters.

Abstract

Large language models (LLMs) have demonstrated impressive performance in various natural language processing tasks, yet their ability to perform multi-step logical reasoning remains an open challenge. Although Chain-of-Thought prompting has improved logical reasoning by enabling models to generate intermediate steps, it lacks mechanisms to assess the coherence of these logical transitions. In this paper, we propose a novel, lightweight evaluation strategy for logical reasoning that uses query-key alignments inside transformer attention heads. By computing a single forward pass and extracting a "QK-score" from carefully chosen heads, our method reveals latent representations that reliably separate valid from invalid inferences, offering a scalable alternative to traditional ablation-based techniques. We also provide an empirical validation on multiple logical reasoning benchmarks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks

MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training