Towards a More Reliable Interpretation of Machine Learning Outputs for Safety-Critical Systems using Feature Importance Fusion
Divish Rengasamy, Benjamin Rothwell, Grazziela Figueredo

TL;DR
This paper proposes a framework that combines multiple feature importance measures to improve the reliability of explanations in machine learning models used in safety-critical systems, demonstrating reduced error and robustness to noise.
Contribution
It introduces an extensible ensemble framework for feature importance fusion, including a novel fusion metric, and validates its effectiveness on synthetic data with known ground truth.
Findings
Ensemble approach reduces feature importance error by 15%
Feature importance ensembles are robust to dataset noise
Error increases with more features and orthogonal informative features
Abstract
When machine learning supports decision-making in safety-critical systems, it is important to verify and understand the reasons why a particular output is produced. Although feature importance calculation approaches assist in interpretation, there is a lack of consensus regarding how features' importance is quantified, which makes the explanations offered for the outcomes mostly unreliable. A possible solution to address the lack of agreement is to combine the results from multiple feature importance quantifiers to reduce the variance of estimates. Our hypothesis is that this will lead to more robust and trustworthy interpretations of the contribution of each feature to machine learning predictions. To assist test this hypothesis, we propose an extensible Framework divided in four main parts: (i) traditional data pre-processing and preparation for predictive machine learning models;…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Safety Analysis · Adversarial Robustness in Machine Learning · Fault Detection and Control Systems
