Adversarial Imitation Learning from Incomplete Demonstrations
Mingfei Sun, Xiaojuan Ma

TL;DR
This paper introduces AGAIL, a novel imitation learning algorithm that effectively learns policies from incomplete demonstrations by leveraging available action information as guidance.
Contribution
AGAIL is the first method to incorporate partial action sequences into adversarial imitation learning, improving policy learning from incomplete demonstrations.
Findings
AGAIL performs comparably to state-of-the-art methods on benchmarks.
AGAIL effectively utilizes partial action information for policy training.
The method is robust to incomplete demonstration data.
Abstract
Imitation learning targets deriving a mapping from states to actions, a.k.a. policy, from expert demonstrations. Existing methods for imitation learning typically require any actions in the demonstrations to be fully available, which is hard to ensure in real applications. Though algorithms for learning with unobservable actions have been proposed, they focus solely on state information and overlook the fact that the action sequence could still be partially available and provide useful information for policy deriving. In this paper, we propose a novel algorithm called Action-Guided Adversarial Imitation Learning (AGAIL) that learns a policy from demonstrations with incomplete action sequences, i.e., incomplete demonstrations. The core idea of AGAIL is to separate demonstrations into state and action trajectories, and train a policy with state trajectories while using actions as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications
