Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards

Yaoze Guo; Shana Moothedath

arXiv:2604.03891·cs.LG·April 7, 2026

Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards

Yaoze Guo, Shana Moothedath

PDF

TL;DR

This paper introduces a low-rank reward matrix estimation framework for multi-task reinforcement learning, enabling shared representation learning and near-optimal policy derivation under relaxed assumptions.

Contribution

It proposes a novel low-rank matrix estimation approach for multi-task RL that relaxes previous restrictive assumptions and provides theoretical guarantees.

Findings

01

Accurate low-rank reward matrix recovery under general feature distributions.

02

Theoretical bounds on sample complexity and representation error.

03

Experimental validation of shared representation learning from finite data.

Abstract

Multi-task representation learning (MTRL) is an approach that learns shared latent representations across related tasks, facilitating collaborative learning that improves the overall learning efficiency. This paper studies MTRL for multi-task reinforcement learning (RL), where multiple tasks have the same state-action space and transition probabilities, but different rewards. We consider T linear Markov Decision Processes (MDPs) where the reward functions and transition dynamics admit linear feature embeddings of dimension d. The relatedness among the tasks is captured by a low-rank structure on the reward matrices. Learning shared representations across multiple RL tasks is challenging due to the complex and policy-dependent nature of data that leads to a temporal progression of error. Our approach adopts a reward-free reinforcement learning framework to first learn a data-collection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.