Toward Robust Long Range Policy Transfer

Wei-Cheng Tseng; Jin-Siang Lin; Yao-Min Feng; Min Sun

arXiv:2103.02957·cs.LG·March 5, 2021

Toward Robust Long Range Policy Transfer

Wei-Cheng Tseng, Jin-Siang Lin, Yao-Min Feng, Min Sun

PDF

Open Access 1 Video

TL;DR

This paper introduces a hierarchical policy transfer method that enhances the diversity and adaptability of primitive policies, enabling more effective transfer to complex new tasks with continuous actions.

Contribution

It proposes a novel approach that trains a combination function and adapts primitive policies, with regularization to improve diversity, outperforming existing transfer methods.

Findings

01

Outperforms recent policy transfer methods in continuous action tasks.

02

Provides broader transferability range compared to prior approaches.

03

Regularization terms are crucial for effective long-range transfer.

Abstract

Humans can master a new task within a few trials by drawing upon skills acquired through prior experience. To mimic this capability, hierarchical models combining primitive policies learned from prior tasks have been proposed. However, these methods fall short comparing to the human's range of transferability. We propose a method, which leverages the hierarchical structure to train the combination function and adapt the set of diverse primitive polices alternatively, to efficiently produce a range of complex behaviors on challenging new tasks. We also design two regularization terms to improve the diversity and utilization rate of the primitives in the pre-training phase. We demonstrate that our method outperforms other recent policy transfer methods by combining and adapting these reusable primitives in tasks with continuous action space. The experiment results further show that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Toward Robust Long Range Policy Transfer· underline

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Reinforcement Learning in Robotics