Generative adversarial training of product of policies for robust and adaptive movement primitives
Emmanuel Pignat, Hakan Girgin, Sylvain Calinon

TL;DR
This paper introduces a generative adversarial training method for product of policies to improve robustness and adaptability in learning movement primitives from demonstrations, addressing dependencies often ignored in simpler models.
Contribution
It proposes using approximate trajectory distributions as discriminators within a GAN framework to enhance learning stability and speed, while incorporating product of Gaussian policies and ensemble methods for robustness.
Findings
Validated on a 7-DoF manipulator
Improved adaptability to varying contexts
Enhanced robustness to perturbations
Abstract
In learning from demonstrations, many generative models of trajectories make simplifying assumptions of independence. Correctness is sacrificed in the name of tractability and speed of the learning phase. The ignored dependencies, which often are the kinematic and dynamic constraints of the system, are then only restored when synthesizing the motion, which introduces possibly heavy distortions. In this work, we propose to use those approximate trajectory distributions as close-to-optimal discriminators in the popular generative adversarial framework to stabilize and accelerate the learning procedure. The two problems of adaptability and robustness are addressed with our method. In order to adapt the motions to varying contexts, we propose to use a product of Gaussian policies defined in several parametrized task spaces. Robustness to perturbations and varying dynamics is ensured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics
