GLocalX -- From Local to Global Explanations of Black Box AI Models

Mattia Setzu; Riccardo Guidotti; Anna Monreale; Franco Turini; Dino; Pedreschi; Fosca Giannotti

arXiv:2101.07685·cs.LG·January 29, 2021

GLocalX -- From Local to Global Explanations of Black Box AI Models

Mattia Setzu, Riccardo Guidotti, Anna Monreale, Franco Turini, Dino, Pedreschi, Fosca Giannotti

PDF

1 Repo

TL;DR

GLocalX is a novel explanation method that hierarchically aggregates local explanations to produce accurate, simple, and interpretable global models of black box AI, enhancing transparency in high-stakes domains.

Contribution

It introduces GLocalX, a local-first, model-agnostic explanation technique that generalizes local decision rules into global explanations, often replacing complex models with simpler, interpretable ones.

Findings

01

GLocalX accurately emulates complex models with simple models.

02

It achieves state-of-the-art performance compared to global solutions.

03

High accuracy and interpretability are possible without trade-offs.

Abstract

Artificial Intelligence (AI) has come to prominence as one of the major components of our society, with applications in most aspects of our lives. In this field, complex and highly nonlinear machine learning models such as ensemble models, deep neural networks, and Support Vector Machines have consistently shown remarkable accuracy in solving complex tasks. Although accurate, AI models often are "black boxes" which we are not able to understand. Relying on these models has a multifaceted impact and raises significant concerns about their transparency. Applications in sensitive and critical domains are a strong motivational factor in trying to understand the behavior of black boxes. We propose to address this issue by providing an interpretable layer on top of black box models by aggregating "local" explanations. We present GLocalX, a "local-first" model agnostic explanation method.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

msetzu/glocalx
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.