Quantum machine learning with glow for episodic tasks and decision games
Jens Clausen, Hans J. Briegel

TL;DR
This paper introduces a quantum reinforcement learning model that combines projective simulation with quantum systems, enabling learning in episodic tasks and decision games through a glow mechanism for policy updates.
Contribution
It presents a novel quantum RL agent model integrating projective simulation with glow-based backpropagation, enhancing generalization in quantum environments.
Findings
Successfully applied to invasion game and grid world tasks
Demonstrates quantum advantage in learning efficiency
Provides a framework for quantum RL with memory and policy updates
Abstract
We consider a general class of models, where a reinforcement learning (RL) agent learns from cyclic interactions with an external environment via classical signals. Perceptual inputs are encoded as quantum states, which are subsequently transformed by a quantum channel representing the agent's memory, while the outcomes of measurements performed at the channel's output determine the agent's actions. The learning takes place via stepwise modifications of the channel properties. They are described by an update rule that is inspired by the projective simulation (PS) model and equipped with a glow mechanism that allows for a backpropagation of policy changes, analogous to the eligibility traces in RL and edge glow in PS. In this way, the model combines features of PS with the ability for generalization, offered by its physical embodiment as a quantum system. We apply the agent to various…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
