Pre or Post-Softmax Scores in Gradient-based Attribution Methods, What   is Best?

Miguel Lerma; Mirtha Lucas

arXiv:2306.13197·cs.LG·July 20, 2023

Pre or Post-Softmax Scores in Gradient-based Attribution Methods, What is Best?

Miguel Lerma, Mirtha Lucas

PDF

Open Access 1 Repo

TL;DR

This paper compares the effectiveness of using pre-softmax versus post-softmax scores in gradient-based attribution methods for neural network classifiers, highlighting practical differences and implications.

Contribution

It provides a detailed analysis of the advantages and disadvantages of pre-softmax and post-softmax gradients in attribution methods.

Findings

01

Pre-softmax gradients often provide clearer attribution signals.

02

Post-softmax gradients can be more stable but less informative.

03

The choice impacts the interpretability of attribution results.

Abstract

Gradient based attribution methods for neural networks working as classifiers use gradients of network scores. Here we discuss the practical differences between using gradients of pre-softmax scores versus post-softmax scores, and their respective advantages and disadvantages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mlerma54/adversarial-attacks-on-saliency-maps
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Stochastic Gradient Optimization Techniques · Machine Learning and ELM