Hybrid Quantum-Classical Recurrent Neural Networks
Wenduan Xu

TL;DR
This paper introduces a hybrid quantum-classical recurrent neural network (QRNN) architecture that leverages quantum memory and classical control to perform sequence learning tasks, demonstrating competitive results in various benchmarks.
Contribution
The paper presents the first quantum-grounded RNN model combining unitary quantum recurrence with classical nonlinear control, enabling efficient sequence learning.
Findings
Achieved competitive performance on sentiment analysis, MNIST, and language modeling.
Implemented a quantum-inspired attention mechanism improving translation tasks.
Demonstrated the model's scalability up to 14 qubits in simulation.
Abstract
We present a hybrid quantum-classical recurrent neural network (QRNN) architecture in which the recurrent core is realized as a parametrized quantum circuit (PQC) controlled by a classical feedforward network. The hidden state is the quantum state of an -qubit PQC in an exponentially large Hilbert space , which serves as a coherent recurrent quantum memory. The PQC is unitary by construction, making the hidden-state evolution norm-preserving without external constraints. At each timestep, mid-circuit Pauli expectation-value readouts are combined with the input embedding and processed by the feedforward network, which provides explicit classical nonlinearity. The outputs parametrize the PQC, which updates the hidden state via unitary dynamics. The QRNN is compact and physically consistent, and it unifies (i) unitary recurrence as a high-capacity memory, (ii) partial…
Peer Reviews
Decision·Submitted to ICLR 2026
- The model is evaluated on six diverse tasks (sentiment analysis, MNIST, pMNIST, copying memory, language modeling, machine translation) and is shown to be competitive with or outperform classical RNNs, LSTMs, and specifically designed orthogonal RNNs (scoRNN). - The paper is generally well-structured and clearly written. I enjoyed the reading flow. - The paper thoughtfully discusses the path to hardware implementation, acknowledging current simulation limits and proposing a realistic ancilla
- What is the advantage of Hybrid Quantum-Classical Recurrent Neural Network? comparing to Classical RNN or other Hybrid quantum NN (Like Hybrid quantum CNN if one can implement)? - It would be nice if the authors could explain something related to GPU consumption or efficency. - It would be nice if the authors could visualize something related to the intermediate hidden states, like the state change in QRNN. - To which perspective does the design of QCNN could benefit the majority of ICLR
1. The proposed quantum model is run across several/realistic sequence tasks instead of just MNIST or toy memory tasks. 2. The authors give hyperparameters, optimizer, qubit counts, measurement sets, and even report variability across 50-100 runs in the appendix.
1. All results are obtained in TorchQuantum on GPUs, and there is no real hardware, no noisy simulator, no demonstration that the mid-circuit readout trick they rely on can actually be executed at the depth/width they need. The paper itself admits that it models mid-circuit measurement “as a limiting case” and that present toolchains are “less optimized” for hybrid recurrence. 2. The paper leans heavily on: “the PQC is unitary ⇒ norm-preserving ⇒ better gradients ⇒ better long-sequence learning
Novel architectural concept: The idea of embedding quantum circuits within RNN recurrence steps is original and conceptually interesting, especially for sequence modeling. Comprehensive benchmarking: The authors evaluate the model across diverse tasks, including language modeling and translation, showing its general applicability. Mathematical consistency: The recurrent evolution is unitary by construction, automatically ensuring norm preservation—an elegant contrast to the ad-hoc regularization
Ambiguity in circuit design: The paper does not clearly distinguish between data-encoding and trainable parts of the PQC. From Figure 1A, the first layer of RY gates seems to correspond to data encoding, but this is not explicitly stated. Without this separation, it is difficult to evaluate what portion of the model’s expressivity truly arises from quantum effects. Modest empirical gains: Across tasks, improvements are small (e.g., +2\% accuracy in classification, BLEU 29.2 --> 31.9 for German
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Quantum many-body systems · Quantum Information and Cryptography
