On the Power of Multitask Representation Learning in Linear MDP
Rui Lu, Gao Huang, Simon S. Du

TL;DR
This paper provides a theoretical analysis of multitask representation learning in linear MDPs, showing it reduces sample complexity and highlighting the importance of the LAFA criterion and adaptive sampling, supported by empirical results.
Contribution
It introduces the LAFA criterion and demonstrates how multitask representation learning can significantly lower sample complexity in linear MDPs, with theoretical and empirical validation.
Findings
LAFA criterion $oldsymbol{}$ influences sample efficiency
Multitask learning reduces required samples for new tasks
Empirical results support theoretical analysis
Abstract
While multitask representation learning has become a popular approach in reinforcement learning (RL), theoretical understanding of why and when it works remains limited. This paper presents analyses for the statistical benefit of multitask representation learning in linear Markov Decision Process (MDP) under a generative model. In this paper, we consider an agent to learn a representation function out of a function class from source tasks with data per task, and then use the learned to reduce the required number of sample for a new task. We first discover a \emph{Least-Activated-Feature-Abundance} (LAFA) criterion, denoted as , with which we prove that a straightforward least-square algorithm learns a policy which is sub-optimal. Here is the planning horizon,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning
