The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure
Tyler Sam, Yudong Chen, Christina Lee Yu

TL;DR
This paper investigates the limits of transfer reinforcement learning using latent low-rank structures, proposing algorithms that leverage linear representations to reduce complexity and establishing their optimality bounds.
Contribution
It introduces a transfer-ability coefficient and algorithms for latent low-rank MDPs, achieving near-optimal regret bounds that depend on the transfer difficulty.
Findings
Algorithms effectively reduce dependence on state and action space sizes.
Proved minimax optimality of the algorithms in most settings.
Provided lower bounds matching the upper bounds for transfer learning.
Abstract
Many reinforcement learning (RL) algorithms are too costly to use in practice due to the large sizes of the problem's state and action space. To resolve this issue, we study transfer RL with latent low rank structure. We consider the problem of transferring a latent low rank representation when the source and target MDPs have transition kernels with Tucker rank , , or . In each setting, we introduce the transfer-ability coefficient that measures the difficulty of representational transfer. Our algorithm learns latent representations in each source MDP and then exploits the linear structure to remove the dependence on , or in the target MDP regret bound. We complement our positive results with information theoretic lower bounds that show our algorithms (excluding the () setting) are minimax-optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and ELM · Domain Adaptation and Few-Shot Learning · Adaptive Dynamic Programming Control
MethodsTuckER
