ScoresActivation: A New Activation Function for Model Agnostic Global Explainability by Design
Emanuel Covaci, Fabian Galis, Radu Balan, Daniela Zaharie, Darian Onchis

TL;DR
ScoresActivation introduces a differentiable feature-ranking mechanism integrated into model training, enabling globally faithful, stable, and fast feature importance explanations while maintaining high predictive accuracy.
Contribution
The paper presents ScoresActivation, a novel differentiable activation function that embeds feature importance estimation directly into the training process for global explainability.
Findings
Feature scoring is 150 times faster than SHAP.
Improves classification accuracy significantly with relevant features.
Produces globally faithful and stable feature rankings.
Abstract
Understanding the decision of large deep learning models is a critical challenge for building transparent and trustworthy systems. Although the current post hoc explanation methods offer valuable insights into feature importance, they are inherently disconnected from the model training process, limiting their faithfulness and utility. In this work, we introduce a novel differentiable approach to global explainability by design, integrating feature importance estimation directly into model training. Central to our method is the ScoresActivation function, a feature-ranking mechanism embedded within the learning pipeline. This integration enables models to prioritize features according to their contribution to predictive performance in a differentiable and end-to-end trainable manner. Evaluations across benchmark datasets show that our approach yields globally faithful, stable feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Graph Neural Networks
