Renaissance Robot: Optimal Transport Policy Fusion for Learning Diverse   Skills

Julia Tan; Ransalu Senanayake; Fabio Ramos

arXiv:2207.00978·cs.LG·July 5, 2022

Renaissance Robot: Optimal Transport Policy Fusion for Learning Diverse Skills

Julia Tan, Ransalu Senanayake, Fabio Ramos

PDF

Open Access 1 Repo

TL;DR

This paper introduces a post-hoc policy fusion method using Optimal Transport to combine knowledge from multiple RL agents, enabling faster learning of new skills in robotics.

Contribution

It proposes a novel Optimal Transport-based policy fusion technique that improves initialization for new tasks, reducing training time and computational resources.

Findings

01

Unified policies enable quicker skill acquisition.

02

Significant reduction in training time for new tasks.

03

Effective knowledge consolidation across diverse agents.

Abstract

Deep reinforcement learning (RL) is a promising approach to solving complex robotics problems. However, the process of learning through trial-and-error interactions is often highly time-consuming, despite recent advancements in RL algorithms. Additionally, the success of RL is critically dependent on how well the reward-shaping function suits the task, which is also time-consuming to design. As agents trained on a variety of robotics problems continue to proliferate, the ability to reuse their valuable learning for new domains becomes increasingly significant. In this paper, we propose a post-hoc technique for policy fusion using Optimal Transport theory as a robust means of consolidating the knowledge of multiple agents that have been trained on distinct scenarios. We further demonstrate that this provides an improved weights initialisation of the neural network policy for learning new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

julia-lina-tan/ot-policy-fusion
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms