Hard to Explain: On the Computational Hardness of In-Distribution Model Interpretation
Guy Amir, Shahaf Bassan, Guy Katz

TL;DR
This paper explores how the underlying data distribution impacts the computational complexity of interpreting machine learning models, emphasizing the importance of distribution-aware explanations for social alignment and interpretability.
Contribution
It introduces the consideration of data distribution as a key factor in interpretability complexity, extending prior models to include distribution effects for more socially aligned explanations.
Findings
Distribution significantly affects interpretation complexity.
Models with distribution-aware explanations can improve social alignment.
Essential prerequisites for socially aligned explanations are identified.
Abstract
The ability to interpret Machine Learning (ML) models is becoming increasingly essential. However, despite significant progress in the field, there remains a lack of rigorous characterization regarding the innate interpretability of different models. In an attempt to bridge this gap, recent work has demonstrated that it is possible to formally assess interpretability by studying the computational complexity of explaining the decisions of various models. In this setting, if explanations for a particular model can be obtained efficiently, the model is considered interpretable (since it can be explained ``easily''). However, if generating explanations over an ML model is computationally intractable, it is considered uninterpretable. Prior research identified two key factors that influence the complexity of interpreting an ML model: (i) the type of the model (e.g., neural networks, decision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Data Analysis with R · Time Series Analysis and Forecasting
