Active Imitation Learning via Reduction to I.I.D. Active Learning
Kshitij Judah, Alan Fern, Thomas G. Dietterich

TL;DR
This paper presents a new active imitation learning method that reduces the problem to i.i.d. active learning, significantly decreasing expert queries needed for policy learning and demonstrating effectiveness across multiple domains.
Contribution
It introduces a reduction-based approach to active imitation learning leveraging i.i.d. active learning techniques, with theoretical analysis and a practical algorithm.
Findings
Reduced label complexity compared to passive learning
Effective performance in four test domains
Theoretical bounds for non-stationary and stationary policies
Abstract
In standard passive imitation learning, the goal is to learn a target policy by passively observing full execution trajectories of it. Unfortunately, generating such trajectories can require substantial expert effort and be impractical in some cases. In this paper, we consider active imitation learning with the goal of reducing this effort by querying the expert about the desired action at individual states, which are selected based on answers to past queries and the learner's interactions with an environment simulator. We introduce a new approach based on reducing active imitation learning to i.i.d. active learning, which can leverage progress in the i.i.d. setting. Our first contribution, is to analyze reductions for both non-stationary and stationary policies, showing that the label complexity (number of queries) of active imitation learning can be substantially less than passive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Optimization and Search Problems · Reinforcement Learning in Robotics
