Finding Label and Model Errors in Perception Data With Learned Observation Assertions
Daniel Kang, Nikos Arechiga, Sudeep Pillai, Peter Bailis, Matei, Zaharia

TL;DR
This paper introduces Fixy, a system that learns to identify errors in labeled perception data by modeling feature distributions, significantly improving error detection precision in autonomous vehicle datasets.
Contribution
The paper presents a novel abstraction called learned observation assertions and implements it in Fixy, which leverages existing data and models to detect label errors more accurately.
Findings
Fixy achieves up to 2× higher precision in error ranking.
Fixy effectively utilizes existing resources to learn feature distributions.
The approach outperforms recent model assertion techniques and uncertainty sampling.
Abstract
ML is being deployed in complex, real-world scenarios where errors have impactful consequences. In these systems, thorough testing of the ML pipelines is critical. A key component in ML deployment pipelines is the curation of labeled training data. Common practice in the ML literature assumes that labels are the ground truth. However, in our experience in a large autonomous vehicle development center, we have found that vendors can often provide erroneous labels, which can lead to downstream safety risks in trained models. To address these issues, we propose a new abstraction, learned observation assertions, and implement it in a system called Fixy. Fixy leverages existing organizational resources, such as existing (possibly noisy) labeled datasets or previously trained ML models, to learn a probabilistic model for finding errors in human- or model-generated labels. Given…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Machine Learning and Data Classification · Autonomous Vehicle Technology and Safety
