Practical Hybrid Quantum Language Models with Observable Readout on Real Hardware

Stefan Balauca; Ada-Astrid Balauca; Adrian Iftene

arXiv:2512.12710·quant-ph·December 16, 2025

Practical Hybrid Quantum Language Models with Observable Readout on Real Hardware

Stefan Balauca, Ada-Astrid Balauca, Adrian Iftene

PDF

Open Access 3 Reviews

TL;DR

This paper demonstrates the first end-to-end training of hybrid quantum language models, specifically QRNNs and QCNNs, on real quantum hardware, showing their potential for sequential data processing despite hardware noise.

Contribution

It introduces a practical architecture for hybrid quantum language models with observable readout and provides empirical evidence of their capability on real quantum devices.

Findings

01

Successful training of quantum language models on IBM Quantum hardware.

02

Trade-offs identified between circuit depth and trainability due to noise.

03

Observable readout enables learning of sequential patterns on NISQ devices.

Abstract

Hybrid quantum-classical models represent a crucial step toward leveraging near-term quantum devices for sequential data processing. We present Quantum Recurrent Neural Networks (QRNNs) and Quantum Convolutional Neural Networks (QCNNs) as hybrid quantum language models, reporting the first empirical demonstration of generative language modeling trained and evaluated end-to-end on real quantum hardware. Our architecture combines hardware-optimized parametric quantum circuits with a lightweight classical projection layer, utilizing a multi-sample SPSA strategy to efficiently train quantum parameters despite hardware noise. To characterize the capabilities of these models, we introduce a synthetic dataset designed to isolate syntactic dependencies in a controlled, low-resource environment. Experiments on IBM Quantum processors reveal the critical trade-offs between circuit depth and…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 4Confidence 3

Strengths

- Introduction of QRNN and QCNN as hardware-feasible quantum analogues of classical sequence models. - Use of multi-sample SPSA for efficient and hardware-compatible gradient estimation, combined with standard backpropagation on classical layers. - Synthetic dataset (TS-LM) for next-word prediction, code and circuits made available for replication.

Weaknesses

- The benchmarked dataset migth be too toy for NLP - It is unclear whether the models are scalable - It might help to discuss the relation/connections with existing work like https://arxiv.org/pdf/2302.13812

Reviewer 02Rating 2Confidence 3

Strengths

* The paper represents a solid engineering effort to implement quantum sequence models on real hardware. The circuits are designed with attention to connectivity and noise limitations, and the authors report hardware-specific details such as gate counts, layouts, and shot configurations. * The manuscript is clearly written, with good structure and detailed appendices. Figures and tables are informative, and experimental settings are transparent. * The work includes ablations on number of shots,

Weaknesses

* The paper does not convincingly explain why quantum circuits are needed for language modeling. It suggests that quantum computation might provide richer representations but offers no theoretical or empirical justification. There is no discussion of what properties of language data could benefit from quantum operations or what specific limitation of classical sequence models this work intends to overcome. * The findings primarily confirm that current hardware can execute small parameterized cir

Reviewer 03Rating 0Confidence 4

Strengths

This work includes results from real quantum hardware.

Weaknesses

1. All elements (QRNN, QCNN, QNLP, SPSA, etc.) are well known. The results show no surprisingly good performance and only match classical baselines with similar parameter counts (a few hundred). This limits the work's novelty and contribution. 2. The task is limited to binary classification and text generation with a synthetic small dataset, raising questions about real-world applicability. 3. The paper lacks theoretical guarantees and discussion of the approach's foundations.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum Computing Algorithms and Architecture · Quantum many-body systems · Machine Learning in Materials Science