Experimental quantum speed-up in reinforcement learning agents
Valeria Saggio, Beate E. Asenbeck, Arne Hamann, Teodor Str\"omberg,, Peter Schiansky, Vedran Dunjko, Nicolai Friis, Nicholas C. Harris, Michael, Hochberg, Dirk Englund, Sabine W\"olk, Hans J. Briegel, Philip Walther

TL;DR
This paper demonstrates a quantum advantage in reinforcement learning by using a hybrid quantum-classical communication protocol on a nanophotonic processor, significantly reducing learning time compared to classical methods.
Contribution
It introduces a novel hybrid quantum-classical reinforcement learning protocol and experimentally demonstrates a quantum speed-up using a tunable integrated nanophotonic device.
Findings
Quantum communication enhances learning speed in RL agents.
Hybrid quantum-classical approach allows optimal control of learning.
Experimental setup shows practical integration potential in quantum networks.
Abstract
Increasing demand for algorithms that can learn quickly and efficiently has led to a surge of development within the field of artificial intelligence (AI). An important paradigm within AI is reinforcement learning (RL), where agents interact with environments by exchanging signals via a communication channel. Agents can learn by updating their behaviour based on obtained feedback. The crucial question for practical applications is how fast agents can learn to respond correctly. An essential figure of merit is therefore the learning time. While various works have made use of quantum mechanics to speed up the agent's decision-making process, a reduction in learning time has not been demonstrated yet. Here we present a RL experiment where the learning of an agent is boosted by utilizing a quantum communication channel with the environment. We further show that the combination with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
