Streamlining models with explanations in the learning loop

Francesco Lomuscio; Paolo Bajardi; Alan Perotti; and Elvio G. Amparore

arXiv:2302.07760·cs.LG·February 16, 2023

Streamlining models with explanations in the learning loop

Francesco Lomuscio, Paolo Bajardi, Alan Perotti, and Elvio G. Amparore

PDF

1 Repo

TL;DR

This paper introduces methods to incorporate local explanation insights into the training process, creating streamlined models that better align with explanation relevance while maintaining accuracy.

Contribution

It proposes two novel strategies, Iterative Dataset Weighting and Targeted Replacement Values, to integrate explanations into feature engineering for improved model interpretability.

Findings

01

Streamlined models achieve comparable accuracy to original black-box classifiers.

02

Explanations become more compact and relevant in the new models.

03

The methods enhance the interpretability of machine learning models.

Abstract

Several explainable AI methods allow a Machine Learning user to get insights on the classification process of a black-box model in the form of local linear explanations. With such information, the user can judge which features are locally relevant for the classification outcome, and get an understanding of how the model reasons. Standard supervised learning processes are purely driven by the original features and target labels, without any feedback loop informed by the local relevance of the features identified by the post-hoc explanations. In this paper, we exploit this newly obtained information to design a feature engineering phase, where we combine explanations with feature values. To do so, we develop two different strategies, named Iterative Dataset Weighting and Targeted Replacement Values, which generate streamlined models that better mimic the explanation process presented to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

francescolomuscio/explanations-in-the-loop
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.