Supersparse Linear Integer Models for Interpretable Classification

Berk Ustun; Stefano Trac\`a; Cynthia Rudin

arXiv:1306.6677·stat.ML·April 14, 2014·25 cites

Supersparse Linear Integer Models for Interpretable Classification

Berk Ustun, Stefano Trac\`a, Cynthia Rudin

PDF

Open Access

TL;DR

This paper introduces SLIM, a tool for creating highly accurate, sparse, and interpretable scoring systems for classification, optimized through discrete methods for practical use in fields like medicine and criminology.

Contribution

The paper presents SLIM, a novel discrete optimization approach for generating scoring systems that are both accurate and highly interpretable, with demonstrated practical applications.

Findings

01

SLIM produces scoring systems that are both accurate and sparse.

02

SLIM outperforms state-of-the-art models in interpretability and accuracy.

03

Applications in medicine and criminology show practical utility.

Abstract

Scoring systems are classification models that only require users to add, subtract and multiply a few meaningful numbers to make a prediction. These models are often used because they are practical and interpretable. In this paper, we introduce an off-the-shelf tool to create scoring systems that both accurate and interpretable, known as a Supersparse Linear Integer Model (SLIM). SLIM is a discrete optimization problem that minimizes the 0-1 loss to encourage a high level of accuracy, regularizes the L0-norm to encourage a high level of sparsity, and constrains coefficients to a set of interpretable values. We illustrate the practical and interpretable nature of SLIM scoring systems through applications in medicine and criminology, and show that they are are accurate and sparse in comparison to state-of-the-art classification models using numerical experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Imbalanced Data Classification Techniques · Adversarial Robustness in Machine Learning