On the Power of Multitask Representation Learning in Linear MDP

Rui Lu; Gao Huang; Simon S. Du

arXiv:2106.08053·cs.LG·June 16, 2021·5 cites

On the Power of Multitask Representation Learning in Linear MDP

Rui Lu, Gao Huang, Simon S. Du

PDF

Open Access

TL;DR

This paper provides a theoretical analysis of multitask representation learning in linear MDPs, showing it reduces sample complexity and highlighting the importance of the LAFA criterion and adaptive sampling, supported by empirical results.

Contribution

It introduces the LAFA criterion and demonstrates how multitask representation learning can significantly lower sample complexity in linear MDPs, with theoretical and empirical validation.

Findings

01

LAFA criterion $oldsymbol{}$ influences sample efficiency

02

Multitask learning reduces required samples for new tasks

03

Empirical results support theoretical analysis

Abstract

While multitask representation learning has become a popular approach in reinforcement learning (RL), theoretical understanding of why and when it works remains limited. This paper presents analyses for the statistical benefit of multitask representation learning in linear Markov Decision Process (MDP) under a generative model. In this paper, we consider an agent to learn a representation function $ϕ$ out of a function class $Φ$ from $T$ source tasks with $N$ data per task, and then use the learned $\hat{ϕ}$ to reduce the required number of sample for a new task. We first discover a \emph{Least-Activated-Feature-Abundance} (LAFA) criterion, denoted as $κ$ , with which we prove that a straightforward least-square algorithm learns a policy which is $\tilde{O} (H^{2} \frac{C ( Φ ) ^{2} κ d}{N T} + \frac{κ d}{n})$ sub-optimal. Here $H$ is the planning horizon,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning