Designing Inherently Interpretable Machine Learning Models
Agus Sudjianto, Aijun Zhang

TL;DR
This paper discusses the design principles and practical approaches for creating inherently interpretable machine learning models, emphasizing transparency and regulatory compliance in high-stakes industries.
Contribution
It introduces a qualitative template for assessing interpretability and provides design principles with real-world examples for developing high-performance, inherently interpretable models.
Findings
Proposed a qualitative template for interpretability assessment
Reviewed recent works on interpretable models like ExNN, GAMI-Net, SIMTree
Demonstrated design of an interpretable ReLU DNN for credit default prediction
Abstract
Interpretable machine learning (IML) becomes increasingly important in highly regulated industry sectors related to the health and safety or fundamental rights of human beings. In general, the inherently IML models should be adopted because of their transparency and explainability, while black-box models with model-agnostic explainability can be more difficult to defend under regulatory scrutiny. For assessing inherent interpretability of a machine learning model, we propose a qualitative template based on feature effects and model architecture constraints. It provides the design principles for high-performance IML model development, with examples given by reviewing our recent works on ExNN, GAMI-Net, SIMTree, and the Aletheia toolkit for local linear interpretability of deep ReLU networks. We further demonstrate how to design an interpretable ReLU DNN model with evaluation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare
