Sample-Efficient Expert Query Control in Active Imitation Learning via Conformal Prediction

Arad Firouzkouhi (1); Omid Mirzaeedodangeh (2); Lars Lindemann (2) ((1) University of Southern California; (2) ETH Z\"urich)

arXiv:2512.00453·cs.RO·December 2, 2025

Sample-Efficient Expert Query Control in Active Imitation Learning via Conformal Prediction

Arad Firouzkouhi (1), Omid Mirzaeedodangeh (2), Lars Lindemann (2) ((1) University of Southern California, (2) ETH Z\"urich)

PDF

Open Access

TL;DR

This paper introduces CRSAIL, a novel active imitation learning method that reduces expert queries by selectively requesting expert actions only for under-represented states, using conformal prediction for robust, task-agnostic state novelty detection.

Contribution

CRSAIL is the first querying rule that leverages conformal prediction to adaptively control expert queries based on state novelty, improving efficiency and robustness in active imitation learning.

Findings

01

CRSAIL reduces expert queries by up to 96% compared to DAgger.

02

CRSAIL matches or exceeds expert-level reward in MuJoCo tasks.

03

CRSAIL is robust to parameter choices and applicable to unknown dynamics.

Abstract

Active imitation learning (AIL) combats covariate shift by querying an expert during training. However, expert action labeling often dominates the cost, especially in GPU-intensive simulators, human-in-the-loop settings, and robot fleets that revisit near-duplicate states. We present Conformalized Rejection Sampling for Active Imitation Learning (CRSAIL), a querying rule that requests an expert action only when the visited state is under-represented in the expert-labeled dataset. CRSAIL scores state novelty by the distance to the $K$ -th nearest expert state and sets a single global threshold via conformal prediction. This threshold is the empirical $(1 - α)$ quantile of on-policy calibration scores, providing a distribution-free calibration rule that links $α$ to the expected query rate and makes $α$ a task-agnostic tuning knob. This state-space querying strategy is robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Machine Learning and Algorithms