QPM: Discrete Optimization for Globally Interpretable Image Classification
Thomas Norrenbrock, Timo Kaiser, Sovan Biswas, Ramesh Manuvinakurike,, Bodo Rosenhahn

TL;DR
QPM introduces a novel discrete optimization approach to learn globally interpretable class representations in deep neural networks, enhancing understanding of model behavior while maintaining high accuracy across various datasets.
Contribution
The paper presents QPM, a method that learns compact, contrastive class representations using discrete optimization, enabling global interpretability in image classification models.
Findings
QPM achieves state-of-the-art interpretability on multiple datasets.
QPM maintains high classification accuracy comparable to non-interpretable models.
QPM provides clear, contrastive class representations that are easy to compare.
Abstract
Understanding the classifications of deep neural networks, e.g. used in safety-critical situations, is becoming increasingly important. While recent models can locally explain a single decision, to provide a faithful global explanation about an accurate model's general behavior is a more challenging open task. Towards that goal, we introduce the Quadratic Programming Enhanced Model (QPM), which learns globally interpretable class representations. QPM represents every class with a binary assignment of very few, typically 5, features, that are also assigned to other classes, ensuring easily comparable contrastive class representations. This compact binary assignment is found using discrete optimization based on predefined similarity measures and interpretability constraints. The resulting optimal assignment is used to fine-tune the diverse features, so that each of them becomes the shared…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
