Reinforcement Learning for Robust Calibration of Multi-Qudit Quantum Gates
Amine Jaouadi, Sahel Ashhab

TL;DR
This paper introduces a hybrid optimization approach combining optimal control and deep reinforcement learning to calibrate high-fidelity, robust quantum gates for multi-qudit systems, addressing hardware variability.
Contribution
It presents a novel framework that uses reinforcement learning to refine control pulses, improving robustness against device parameter uncertainties in high-dimensional quantum systems.
Findings
Reinforcement learning learns residual corrections that improve gate robustness.
The method maintains high fidelity despite static model mismatches.
The approach is scalable and practical for real quantum hardware.
Abstract
Higher-dimensional quantum systems, such as qudits, offer architectural and algorithmic advantages over qubits, but their increased spectral crowding and limited controllability render high-fidelity quantum gates particularly challenging. We propose a hybrid optimization framework that integrates optimal control theory methods with contextual deep reinforcement learning to achieve robust controlled-phase gates on two qutrits. Optimal control is first used to design high-fidelity control pulses for a nominal system model. Reinforcement learning is then employed as a calibration stage that learns small residual corrections to these pulses in the presence of static model mismatch, thereby preserving good gate performance under realistic parameter uncertainties. By learning structured, low-dimensional residual corrections conditioned on device-specific parameter variations, reinforcement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
