A Mutual Information-based Metric for Temporal Expressivity and Trainability Estimation in Quantum Policy Gradient Pipelines
Jaehun Jeong, Donghwa Ji, Kabgyun Jeong

TL;DR
This paper introduces a mutual information-based metric to evaluate the temporal expressivity and trainability of quantum policy gradient models, addressing a gap in reinforcement learning analysis.
Contribution
It proposes a novel temporal expressivity measure tailored for reinforcement learning and links it to mutual information and gradient stability in quantum models.
Findings
Mutual information bounds the scaled gradient norm.
Decomposition of temporal expressivity reveals key information.
MI-TET prescreens gradient fragility at initialization.
Abstract
In recent years, various limitations of conventional supervised learning have been identified, motivating the development of reinforcement learning--and quantum reinforcement learning that leverages quantum resources such as entanglement and superposition. Among the various reinforcement learning methodologies, the policy gradient method is considered to have many benefits; for instance, it allows an agent to learn without explicitly knowing the crucial information of the environment such as state transition probabilities and initial state distribution. Meanwhile, from the perspective of learning, two indicators are often regarded as significant: expressivity and trainability (for gradient-based methods). While a number of attempts have been made to quantify the expressivity and trainability of Neural Network models and PQCs, clear efforts suitable for reinforcement learning settings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Quantum many-body systems · Quantum Information and Cryptography
