Generative adversarial training of product of policies for robust and   adaptive movement primitives

Emmanuel Pignat; Hakan Girgin; Sylvain Calinon

arXiv:2011.03316·cs.RO·November 9, 2020

Generative adversarial training of product of policies for robust and adaptive movement primitives

Emmanuel Pignat, Hakan Girgin, Sylvain Calinon

PDF

Open Access

TL;DR

This paper introduces a generative adversarial training method for product of policies to improve robustness and adaptability in learning movement primitives from demonstrations, addressing dependencies often ignored in simpler models.

Contribution

It proposes using approximate trajectory distributions as discriminators within a GAN framework to enhance learning stability and speed, while incorporating product of Gaussian policies and ensemble methods for robustness.

Findings

01

Validated on a 7-DoF manipulator

02

Improved adaptability to varying contexts

03

Enhanced robustness to perturbations

Abstract

In learning from demonstrations, many generative models of trajectories make simplifying assumptions of independence. Correctness is sacrificed in the name of tractability and speed of the learning phase. The ignored dependencies, which often are the kinematic and dynamic constraints of the system, are then only restored when synthesizing the motion, which introduces possibly heavy distortions. In this work, we propose to use those approximate trajectory distributions as close-to-optimal discriminators in the popular generative adversarial framework to stabilize and accelerate the learning procedure. The two problems of adaptability and robustness are addressed with our method. In order to adapt the motions to varying contexts, we propose to use a product of Gaussian policies defined in several parametrized task spaces. Robustness to perturbations and varying dynamics is ensured…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics