Distill-and-Compare: Auditing Black-Box Models Using Transparent Model Distillation
Sarah Tan, Rich Caruana, Giles Hooker, Yin Lou

TL;DR
This paper introduces Distill-and-Compare, a method for auditing black-box risk models by training transparent models to mimic and compare against them, revealing insights without probing the proprietary APIs.
Contribution
The paper presents a novel approach combining model distillation and comparison to audit black-box models without API access, and introduces a statistical test for missing key features.
Findings
Successfully applied to four public datasets.
Identified missing key features in ProPublica's COMPAS data.
Provides a practical tool for auditing proprietary risk models.
Abstract
Black-box risk scoring models permeate our lives, yet are typically proprietary or opaque. We propose Distill-and-Compare, a model distillation and comparison approach to audit such models. To gain insight into black-box models, we treat them as teachers, training transparent student models to mimic the risk scores assigned by black-box models. We compare the student model trained with distillation to a second un-distilled transparent model trained on ground-truth outcomes, and use differences between the two models to gain insight into the black-box model. Our approach can be applied in a realistic setting, without probing the black-box model API. We demonstrate the approach on four public data sets: COMPAS, Stop-and-Frisk, Chicago Police, and Lending Club. We also propose a statistical test to determine if a data set is missing key features used to train the black-box model. Our test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
