MILE: Model-based Intervention Learning

Yigit Korkmaz; Erdem B{\i}y{\i}k

arXiv:2502.13519·cs.RO·February 21, 2025

MILE: Model-based Intervention Learning

Yigit Korkmaz, Erdem B{\i}y{\i}k

PDF

Open Access

TL;DR

MILE introduces a model-based approach to leverage expert interventions and feedback signals in imitation learning, enabling effective policy learning with minimal interventions across diverse environments.

Contribution

The paper presents a novel method that utilizes both intervention data and feedback signals to learn policies efficiently with few expert interventions.

Findings

01

Effective in discrete and continuous environments

02

Successful real-world robotic manipulation application

03

Validated with human subject study

Abstract

Imitation learning techniques have been shown to be highly effective in real-world control scenarios, such as robotics. However, these approaches not only suffer from compounding error issues but also require human experts to provide complete trajectories. Although there exist interactive methods where an expert oversees the robot and intervenes if needed, these extensions usually only utilize the data collected during intervention periods and ignore the feedback signal hidden in non-intervention timesteps. In this work, we create a model to formulate how the interventions occur in such cases, and show that it is possible to learn a policy with just a handful of expert interventions. Our key insight is that it is possible to get crucial information about the quality of the current state and the optimality of the chosen action from expert feedback, regardless of the presence or the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Assessment and Improvement