Adversarial Feature Training for Generalizable Robotic Visuomotor Control
Xi Chen, Ali Ghadirzadeh, M{\aa}rten Bj\"orkman, Patric Jensfelt

TL;DR
This paper introduces an adversarial training method that enables reinforcement learning-based visuomotor policies to generalize across different task domains in robotics, reducing the need for extensive data collection.
Contribution
It presents a novel adversarial domain transfer approach that allows RL-trained visuomotor policies to adapt to new environments using only still images.
Findings
Outperforms prior methods in robotic picking and pouring tasks
Enables policy transfer with minimal data from target domain
Demonstrates improved generalization to novel objects and viewpoints
Abstract
Deep reinforcement learning (RL) has enabled training action-selection policies, end-to-end, by learning a function which maps image pixels to action outputs. However, it's application to visuomotor robotic policy training has been limited because of the challenge of large-scale data collection when working with physical hardware. A suitable visuomotor policy should perform well not just for the task-setup it has been trained for, but also for all varieties of the task, including novel objects at different viewpoints surrounded by task-irrelevant objects. However, it is impractical for a robotic setup to sufficiently collect interactive samples in a RL framework to generalize well to novel aspects of a task. In this work, we demonstrate that by using adversarial training for domain transfer, it is possible to train visuomotor policies based on RL frameworks, and then transfer the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics
