Logic Constraints to Feature Importances
Nicola Picchiotti, Marco Gori

TL;DR
This paper introduces a method to incorporate human prior knowledge of feature importance into AI models through regularization, enhancing interpretability and trustworthiness, with promising results in fairness applications.
Contribution
It proposes a novel framework that extends empirical loss with a regularization term based on feature importance constraints, leveraging local interpretability methods.
Findings
Effective in improving fairness on the Adult dataset
Framework is model-agnostic and adaptable to various applications
Enhances interpretability and trustworthiness of AI models
Abstract
In recent years, Artificial Intelligence (AI) algorithms have been proven to outperform traditional statistical methods in terms of predictivity, especially when a large amount of data was available. Nevertheless, the "black box" nature of AI models is often a limit for a reliable application in high-stakes fields like diagnostic techniques, autonomous guide, etc. Recent works have shown that an adequate level of interpretability could enforce the more general concept of model trustworthiness. The basic idea of this paper is to exploit the human prior knowledge of the features' importance for a specific task, in order to coherently aid the phase of the model's fitting. This sort of "weighted" AI is obtained by extending the empirical loss with a regularization term encouraging the importance of the features to follow predetermined constraints. This procedure relies on local methods for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Machine Learning and Data Classification
MethodsLocal Interpretable Model-Agnostic Explanations
