A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit Sikchi, Caleb Chuck, Amy Zhang, Scott Niekum

TL;DR
This paper introduces DILO, a novel offline imitation learning algorithm that directly learns a multi-step utility function from observation-only data, avoiding intermediate models and scaling effectively to complex, high-dimensional observations.
Contribution
DILO leverages duality to learn policies from observation data without expert actions, simplifying the process and improving scalability and performance over existing methods.
Findings
DILO outperforms existing imitation learning methods on various benchmarks.
It effectively handles high-dimensional observations.
The approach scales well with complex robot morphologies.
Abstract
Demonstrations are an effective alternative to task specification for learning agents in settings where designing a reward function is difficult. However, demonstrating expert behavior in the action space of the agent becomes unwieldy when robots have complex, unintuitive morphologies. We consider the practical setting where an agent has a dataset of prior interactions with the environment and is provided with observation-only expert demonstrations. Typical learning from observations approaches have required either learning an inverse dynamics model or a discriminator as intermediate steps of training. Errors in these intermediate one-step models compound during downstream policy learning or deployment. We overcome these limitations by directly learning a multi-step utility function that quantifies how each action impacts the agent's divergence from the expert's visitation distribution.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Human Pose and Action Recognition · Machine Learning and Data Classification
