Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards
Yaoze Guo, Shana Moothedath

TL;DR
This paper introduces a low-rank reward matrix estimation framework for multi-task reinforcement learning, enabling shared representation learning and near-optimal policy derivation under relaxed assumptions.
Contribution
It proposes a novel low-rank matrix estimation approach for multi-task RL that relaxes previous restrictive assumptions and provides theoretical guarantees.
Findings
Accurate low-rank reward matrix recovery under general feature distributions.
Theoretical bounds on sample complexity and representation error.
Experimental validation of shared representation learning from finite data.
Abstract
Multi-task representation learning (MTRL) is an approach that learns shared latent representations across related tasks, facilitating collaborative learning that improves the overall learning efficiency. This paper studies MTRL for multi-task reinforcement learning (RL), where multiple tasks have the same state-action space and transition probabilities, but different rewards. We consider T linear Markov Decision Processes (MDPs) where the reward functions and transition dynamics admit linear feature embeddings of dimension d. The relatedness among the tasks is captured by a low-rank structure on the reward matrices. Learning shared representations across multiple RL tasks is challenging due to the complex and policy-dependent nature of data that leads to a temporal progression of error. Our approach adopts a reward-free reinforcement learning framework to first learn a data-collection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
