TL;DR
This paper introduces an 'arguing machines' framework where two independent AI systems collaborate with human oversight to improve decision accuracy in life-critical tasks, demonstrated in image classification and autonomous driving.
Contribution
The paper proposes a novel framework of paired AI systems that use disagreement to enhance accuracy with human supervision, applicable to real-world safety-critical systems.
Findings
Reduced ImageNet top-5 error from 8.0% to 2.8%.
Predicted 90.4% of challenging Tesla Autopilot disengagements.
Demonstrated effectiveness in both image classification and autonomous driving.
Abstract
We consider the paradigm of a black box AI system that makes life-critical decisions. We propose an "arguing machines" framework that pairs the primary AI system with a secondary one that is independently trained to perform the same task. We show that disagreement between the two systems, without any knowledge of underlying system design or operation, is sufficient to arbitrarily improve the accuracy of the overall decision pipeline given human supervision over disagreements. We demonstrate this system in two applications: (1) an illustrative example of image classification and (2) on large-scale real-world semi-autonomous driving data. For the first application, we apply this framework to image classification achieving a reduction from 8.0% to 2.8% top-5 error on ImageNet. For the second application, we apply this framework to Tesla Autopilot and demonstrate the ability to predict…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
