Robust Intervention Learning from Emergency Stop Interventions
Ethan Pronovost, Khimya Khetarpal, Siddhartha Srinivasa

TL;DR
This paper introduces a robust method for learning from emergency stop interventions in autonomous systems, effectively handling noisy and incomplete intervention data to improve policy performance.
Contribution
It proposes Residual Intervention Fine-Tuning (RIFT), a novel residual fine-tuning algorithm that combines intervention feedback with prior policies for robust learning.
Findings
RIFT enables consistent policy improvement across various intervention strategies.
Theoretical analysis clarifies when intervention learning guarantees policy enhancement.
Experimental results demonstrate robustness to intervention noise and incompleteness.
Abstract
Human interventions are a common source of data in autonomous systems during testing. These interventions provide an important signal about where the current policy needs improvement, but are often noisy and incomplete. We define Robust Intervention Learning (RIL) as the problem of learning from intervention data while remaining robust to the quality and informativeness of the intervention signal. In the best case, interventions are precise and avoiding them is sufficient to solve the task, but in many realistic settings avoiding interventions is necessary but not sufficient for achieving good performance. We study robust intervention learning in the context of emergency stop interventions and propose Residual Intervention Fine-Tuning (RIFT), a residual fine-tuning algorithm that treats intervention feedback as an incomplete learning signal and explicitly combines it with a prior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
