Skills Regularized Task Decomposition for Multi-task Offline   Reinforcement Learning

Minjong Yoo; Sangwoo Cho; Honguk Woo

arXiv:2408.15593·cs.LG·August 29, 2024·3 cites

Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning

Minjong Yoo, Sangwoo Cho, Honguk Woo

PDF

Open Access 1 Video

TL;DR

This paper introduces a skill-based multi-task offline reinforcement learning method that decomposes tasks into shared subtasks using a Wasserstein auto-encoder, effectively leveraging heterogeneous datasets of varying quality.

Contribution

It proposes a novel task decomposition technique with quality-weighted regularization and dataset augmentation to improve multi-task offline RL performance on complex robotic and navigation tasks.

Findings

01

Outperforms state-of-the-art algorithms on robotic manipulation tasks

02

Robust to datasets with mixed quality levels

03

Enhances policy learning with skill-based dataset augmentation

Abstract

Reinforcement learning (RL) with diverse offline datasets can have the advantage of leveraging the relation of multiple tasks and the common skills learned across those tasks, hence allowing us to deal with real-world complex problems efficiently in a data-driven way. In offline RL where only offline data is used and online interaction with the environment is restricted, it is yet difficult to achieve the optimal policy for multiple tasks, especially when the data quality varies for the tasks. In this paper, we present a skill-based multi-task RL technique on heterogeneous datasets that are generated by behavior policies of different quality. To learn the shareable knowledge across those datasets effectively, we employ a task decomposition method for which common skills are jointly learned and used as guidance to reformulate a task in shared and achievable subtasks. In this joint…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics