Distill-and-Compare: Auditing Black-Box Models Using Transparent Model   Distillation

Sarah Tan; Rich Caruana; Giles Hooker; Yin Lou

arXiv:1710.06169·stat.ML·October 12, 2018·24 cites

Distill-and-Compare: Auditing Black-Box Models Using Transparent Model Distillation

Sarah Tan, Rich Caruana, Giles Hooker, Yin Lou

PDF

Open Access 1 Repo

TL;DR

This paper introduces Distill-and-Compare, a method for auditing black-box risk models by training transparent models to mimic and compare against them, revealing insights without probing the proprietary APIs.

Contribution

The paper presents a novel approach combining model distillation and comparison to audit black-box models without API access, and introduces a statistical test for missing key features.

Findings

01

Successfully applied to four public datasets.

02

Identified missing key features in ProPublica's COMPAS data.

03

Provides a practical tool for auditing proprietary risk models.

Abstract

Black-box risk scoring models permeate our lives, yet are typically proprietary or opaque. We propose Distill-and-Compare, a model distillation and comparison approach to audit such models. To gain insight into black-box models, we treat them as teachers, training transparent student models to mimic the risk scores assigned by black-box models. We compare the student model trained with distillation to a second un-distilled transparent model trained on ground-truth outcomes, and use differences between the two models to gain insight into the black-box model. Our approach can be applied in a realistic setting, without probing the black-box model API. We demonstrate the approach on four public data sets: COMPAS, Stop-and-Frisk, Chicago Police, and Lending Club. We also propose a statistical test to determine if a data set is missing key features used to train the black-box model. Our test…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shftan/auditblackbox
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI