Robot Data Curation with Mutual Information Estimators
Joey Hejna, Suvir Mirchandani, Ashwin Balakrishna, Annie Xie, Ayzaan, Wahid, Jonathan Tompson, Pannag Sanketi, Dhruv Shah, Coline Devin, Dorsa, Sadigh

TL;DR
This paper introduces a novel mutual information estimator using k-nearest neighbors and VAE embeddings to assess and improve the quality of demonstration datasets for robotic imitation learning, leading to better policy performance.
Contribution
The paper presents a new mutual information estimation technique tailored for robotics data, enabling quality assessment and filtering of demonstrations to enhance learning outcomes.
Findings
Effective dataset partitioning by demonstration quality
Improved policy performance with data filtering
Applicable to both simulation and real-world environments
Abstract
The performance of imitation learning policies often hinges on the datasets with which they are trained. Consequently, investment in data collection for robotics has grown across both industrial and academic labs. However, despite the marked increase in the quantity of demonstrations collected, little work has sought to assess the quality of said data despite mounting evidence of its importance in other areas such as vision and language. In this work, we take a critical step towards addressing the data quality in robotics. Given a dataset of demonstrations, we aim to estimate the relative quality of individual demonstrations in terms of both action diversity and predictability. To do so, we estimate the average contribution of a trajectory towards the mutual information between states and actions in the entire dataset, which captures both the entropy of the marginal action distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Scientific Computing and Data Management · Data Quality and Management
