SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models

Shenghua Wan; Yucen Wang; Minghao Shao; Ruying Chen; De-Chuan Zhan

arXiv:2306.10695·cs.LG·June 21, 2023·1 cites

SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models

Shenghua Wan, Yucen Wang, Minghao Shao, Ruying Chen, De-Chuan Zhan

PDF

Open Access 1 Video

TL;DR

SeMAIL is a novel imitation learning algorithm that separates environment dynamics into task-relevant and irrelevant parts, effectively removing distractors and enabling agents to imitate expert behavior efficiently in complex visual environments.

Contribution

The paper introduces SeMAIL, a new method that decouples environment dynamics into relevant and irrelevant components, improving imitation learning in high-dimensional visual tasks.

Findings

01

Achieves near-expert performance on complex visual control tasks.

02

Effectively removes distractors like moving backgrounds.

03

Improves sample efficiency in visual imitation learning.

Abstract

Model-based imitation learning (MBIL) is a popular reinforcement learning method that improves sample efficiency on high-dimension input sources, such as images and videos. Following the convention of MBIL research, existing algorithms are highly deceptive by task-irrelevant information, especially moving distractors in videos. To tackle this problem, we propose a new algorithm - named Separated Model-based Adversarial Imitation Learning (SeMAIL) - decoupling the environment dynamics into two parts by task-relevant dependency, which is determined by agent actions, and training separately. In this way, the agent can imagine its trajectories and imitate the expert behavior efficiently in task-relevant state space. Our method achieves near-expert performance on various visual control tasks with complex observations and the more challenging tasks with different backgrounds from expert…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models· slideslive

Taxonomy

TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning