The Limits of Transfer Reinforcement Learning with Latent Low-rank   Structure

Tyler Sam; Yudong Chen; Christina Lee Yu

arXiv:2410.21601·cs.LG·October 30, 2024

The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure

Tyler Sam, Yudong Chen, Christina Lee Yu

PDF

Open Access 1 Video

TL;DR

This paper investigates the limits of transfer reinforcement learning using latent low-rank structures, proposing algorithms that leverage linear representations to reduce complexity and establishing their optimality bounds.

Contribution

It introduces a transfer-ability coefficient and algorithms for latent low-rank MDPs, achieving near-optimal regret bounds that depend on the transfer difficulty.

Findings

01

Algorithms effectively reduce dependence on state and action space sizes.

02

Proved minimax optimality of the algorithms in most settings.

03

Provided lower bounds matching the upper bounds for transfer learning.

Abstract

Many reinforcement learning (RL) algorithms are too costly to use in practice due to the large sizes $S, A$ of the problem's state and action space. To resolve this issue, we study transfer RL with latent low rank structure. We consider the problem of transferring a latent low rank representation when the source and target MDPs have transition kernels with Tucker rank $(S, d, A)$ , $(S, S, d), (d, S, A)$ , or $(d, d, d)$ . In each setting, we introduce the transfer-ability coefficient $α$ that measures the difficulty of representational transfer. Our algorithm learns latent representations in each source MDP and then exploits the linear structure to remove the dependence on $S, A$ , or $S A$ in the target MDP regret bound. We complement our positive results with information theoretic lower bounds that show our algorithms (excluding the ( $d, d, d$ ) setting) are minimax-optimal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure· slideslive

Taxonomy

TopicsMachine Learning and ELM · Domain Adaptation and Few-Shot Learning · Adaptive Dynamic Programming Control

MethodsTuckER