CUPID: Curating Data your Robot Loves with Influence Functions
Christopher Agia, Rohan Sinha, Jingyun Yang, Rika Antonova, Marco Pavone, Haruki Nishimura, Masha Itkina, Jeannette Bohg

TL;DR
CUPID introduces a novel influence function-based method for robot imitation learning data curation, enabling effective filtering and selection of training demonstrations to improve policy performance in simulation and real-world tasks.
Contribution
The paper presents CUPID, a new influence function-theoretic approach for identifying impactful training data in robot imitation learning, improving policy outcomes and robustness.
Findings
Training with less than 33% of curated data achieves state-of-the-art results.
CUPID effectively identifies data that enhances test-time performance.
The method improves robustness under distribution shifts and isolates spurious correlations.
Abstract
In robot imitation learning, policy performance is tightly coupled with the quality and composition of the demonstration data. Yet, developing a precise understanding of how individual demonstrations contribute to downstream outcomes - such as closed-loop task success or failure - remains a persistent challenge. We propose CUPID, a robot data curation method based on a novel influence function-theoretic formulation for imitation learning policies. Given a set of evaluation rollouts, CUPID estimates the influence of each training demonstration on the policy's expected return. This enables ranking and selection of demonstrations according to their impact on the policy's closed-loop performance. We use CUPID to curate data by 1) filtering out training demonstrations that harm policy performance and 2) subselecting newly collected trajectories that will most improve the policy. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · AI-based Problem Solving and Planning · Scientific Computing and Data Management
