Sample-Efficient Expert Query Control in Active Imitation Learning via Conformal Prediction
Arad Firouzkouhi (1), Omid Mirzaeedodangeh (2), Lars Lindemann (2) ((1) University of Southern California, (2) ETH Z\"urich)

TL;DR
This paper introduces CRSAIL, a novel active imitation learning method that reduces expert queries by selectively requesting expert actions only for under-represented states, using conformal prediction for robust, task-agnostic state novelty detection.
Contribution
CRSAIL is the first querying rule that leverages conformal prediction to adaptively control expert queries based on state novelty, improving efficiency and robustness in active imitation learning.
Findings
CRSAIL reduces expert queries by up to 96% compared to DAgger.
CRSAIL matches or exceeds expert-level reward in MuJoCo tasks.
CRSAIL is robust to parameter choices and applicable to unknown dynamics.
Abstract
Active imitation learning (AIL) combats covariate shift by querying an expert during training. However, expert action labeling often dominates the cost, especially in GPU-intensive simulators, human-in-the-loop settings, and robot fleets that revisit near-duplicate states. We present Conformalized Rejection Sampling for Active Imitation Learning (CRSAIL), a querying rule that requests an expert action only when the visited state is under-represented in the expert-labeled dataset. CRSAIL scores state novelty by the distance to the -th nearest expert state and sets a single global threshold via conformal prediction. This threshold is the empirical quantile of on-policy calibration scores, providing a distribution-free calibration rule that links to the expected query rate and makes a task-agnostic tuning knob. This state-space querying strategy is robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Machine Learning and Algorithms
