TL;DR
GLocalX is a novel explanation method that hierarchically aggregates local explanations to produce accurate, simple, and interpretable global models of black box AI, enhancing transparency in high-stakes domains.
Contribution
It introduces GLocalX, a local-first, model-agnostic explanation technique that generalizes local decision rules into global explanations, often replacing complex models with simpler, interpretable ones.
Findings
GLocalX accurately emulates complex models with simple models.
It achieves state-of-the-art performance compared to global solutions.
High accuracy and interpretability are possible without trade-offs.
Abstract
Artificial Intelligence (AI) has come to prominence as one of the major components of our society, with applications in most aspects of our lives. In this field, complex and highly nonlinear machine learning models such as ensemble models, deep neural networks, and Support Vector Machines have consistently shown remarkable accuracy in solving complex tasks. Although accurate, AI models often are "black boxes" which we are not able to understand. Relying on these models has a multifaceted impact and raises significant concerns about their transparency. Applications in sensitive and critical domains are a strong motivational factor in trying to understand the behavior of black boxes. We propose to address this issue by providing an interpretable layer on top of black box models by aggregating "local" explanations. We present GLocalX, a "local-first" model agnostic explanation method.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
