A Tensor Low-Rank Approximation for Value Functions in Multi-Task   Reinforcement Learning

Sergio Rozada; Santiago Paternain; Juan Andres Bazerque; Antonio G.; Marques

arXiv:2501.10529·cs.LG·January 22, 2025

A Tensor Low-Rank Approximation for Value Functions in Multi-Task Reinforcement Learning

Sergio Rozada, Santiago Paternain, Juan Andres Bazerque, Antonio G., Marques

PDF

Open Access

TL;DR

This paper introduces a low-rank tensor approximation method for multi-task reinforcement learning that captures task similarities implicitly, reducing data needs and improving learning efficiency in diverse environments.

Contribution

It proposes a novel low-rank tensor modeling approach for multi-task Q-functions, enabling implicit task similarity inference without explicit task grouping.

Findings

01

Effective in benchmark inverted pendulums environment

02

Successful application to wireless communication devices

03

Reduces data requirements for multi-task learning

Abstract

In pursuit of reinforcement learning systems that could train in physical environments, we investigate multi-task approaches as a means to alleviate the need for massive data acquisition. In a tabular scenario where the Q-functions are collected across tasks, we model our learning problem as optimizing a higher order tensor structure. Recognizing that close-related tasks may require similar actions, our proposed method imposes a low-rank condition on this aggregated Q-tensor. The rationale behind this approach to multi-task learning is that the low-rank structure enforces the notion of similarity, without the need to explicitly prescribe which tasks are similar, but inferring this information from a reduced amount of data simultaneously with the stochastic optimization of the Q-tensor. The efficiency of our low-rank tensor approach to multi-task learning is demonstrated in two numerical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics