QAP-Router: Tackling Qubit Routing as Dynamic Quadratic Assignment with Reinforcement Learning
Kien X. Nguyen, Ankit Kulshrestha, Ilya Safro, Xiaoyuan Liu

TL;DR
QAP-Router introduces a reinforcement learning approach using a dynamic Quadratic Assignment Problem formulation and a Transformer-based policy network to improve qubit routing efficiency in quantum compilation.
Contribution
It models qubit routing as a dynamic QAP and employs a solution-aware Transformer with lookahead, capturing structure for better routing decisions.
Findings
Reduces CNOT gate count by 15.7% on MQTBench circuits.
Achieves 30.4% reduction on AgentQ datasets.
Improves routing efficiency over existing industry compilers.
Abstract
Qubit routing is a fundamental problem in quantum compilation, known to be NP-hard. Its dynamic nature makes local routing decisions propagate and compound over time, making global efficient solutions challenging. Existing heuristic methods rely on local rules with limited lookahead, while recent learning-based approaches often treat routing as a generic sequential decision problem without fully exploiting its underlying structure. In this paper, we introduce QAP-Router, framing qubit routing based on a dynamic Quadratic Assignment Problem (QAP) formulation. By modeling logical interactions, or quantum gates, as flow matrices and hardware topology as a distance matrix, our approach captures the interaction-distance coupling in a unified objective, which defines the reward in the reinforcement learning environment. To further exploit this structure, the policy network employs a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
