Improving Attribution Methods by Learning Submodular Functions
Piyushi Manupriya, Tarun Ram Menta, J. Saketha Nath, Vineeth N, Balasubramanian

TL;DR
This paper introduces a novel approach that learns a deep submodular scoring function to enhance feature attribution methods by reducing redundancy and increasing specificity, demonstrated through experiments on multiple datasets.
Contribution
It proposes a new formulation for learning deep submodular functions consistent with existing attribution maps, improving attribution specificity and discriminative power.
Findings
Achieves higher attribution specificity
Reduces redundancy in feature attribution
Maintains good discriminative power
Abstract
This work explores the novel idea of learning a submodular scoring function to improve the specificity/selectivity of existing feature attribution methods. Submodular scores are natural for attribution as they are known to accurately model the principle of diminishing returns. A new formulation for learning a deep submodular set function that is consistent with the real-valued attribution maps obtained by existing attribution methods is proposed. The final attribution value of a feature is then defined as the marginal gain in the induced submodular score of the feature in the context of other highly attributed features, thus decreasing the attribution of redundant yet discriminatory features. Experiments on multiple datasets illustrate that the proposed attribution method achieves higher specificity along with good discriminative power. The implementation of our method is publicly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Imbalanced Data Classification Techniques · Adversarial Robustness in Machine Learning
