Designing Inherently Interpretable Machine Learning Models

Agus Sudjianto; Aijun Zhang

arXiv:2111.01743·cs.LG·November 3, 2021·20 cites

Designing Inherently Interpretable Machine Learning Models

Agus Sudjianto, Aijun Zhang

PDF

Open Access 1 Repo

TL;DR

This paper discusses the design principles and practical approaches for creating inherently interpretable machine learning models, emphasizing transparency and regulatory compliance in high-stakes industries.

Contribution

It introduces a qualitative template for assessing interpretability and provides design principles with real-world examples for developing high-performance, inherently interpretable models.

Findings

01

Proposed a qualitative template for interpretability assessment

02

Reviewed recent works on interpretable models like ExNN, GAMI-Net, SIMTree

03

Demonstrated design of an interpretable ReLU DNN for credit default prediction

Abstract

Interpretable machine learning (IML) becomes increasingly important in highly regulated industry sectors related to the health and safety or fundamental rights of human beings. In general, the inherently IML models should be adopted because of their transparency and explainability, while black-box models with model-agnostic explainability can be more difficult to defend under regulatory scrutiny. For assessing inherent interpretability of a machine learning model, we propose a qualitative template based on feature effects and model architecture constraints. It provides the design principles for high-performance IML model development, with examples given by reviewing our recent works on ExNN, GAMI-Net, SIMTree, and the Aletheia toolkit for local linear interpretability of deep ReLU networks. We further demonstrate how to design an interpretable ReLU DNN model with evaluation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

selfexplainml/piml-toolbox
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare