Quantum-Inspired Self-Attention in a Large Language Model

Nikita Kuznetsov; Niyaz Ismagilov; Ernesto Campos

arXiv:2603.03318·cs.CL·March 5, 2026

Quantum-Inspired Self-Attention in a Large Language Model

Nikita Kuznetsov, Niyaz Ismagilov, Ernesto Campos

PDF

Open Access

TL;DR

This paper introduces a quantum-inspired self-attention mechanism integrated into GPT-1, demonstrating improved language modeling performance with only a modest increase in inference time.

Contribution

The paper presents the first integration of quantum-inspired self-attention into a large language model, enhancing performance over standard self-attention in language tasks.

Findings

01

15.5x better character error rate

02

4.7x better word error rate

03

13x lower cross-entropy loss

Abstract

Recent advances in Natural Language Processing have been predominantly driven by transformer-based architectures, which rely heavily on self-attention mechanisms to model relationships between tokens in a sequence. Similarly, the field of Quantum Natural Language Processing, which seeks to leverage quantum principles to address challenges in language understanding and generation tasks, has seen the recent development of quantum self-attention mechanisms. We propose a classical quantum-inspired self-attention (QISA) mechanism and integrate it into the full autoregressive language modeling pipeline of GPT-1. To the best of our knowledge, this is the first integration of this kind, as previous quantum self-attention mechanisms have been primarily tested on text classification. In our experiments, QISA achieves better performance when compared to standard self-attention on the metrics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum Computing Algorithms and Architecture · Quantum many-body systems · Machine Learning in Materials Science