SAGE: Saliency-Guided Contrastive Embeddings
Colton R. Crum, Christopher Sweet, Adam Czajka

TL;DR
SAGE introduces a novel contrastive loss that leverages human saliency in the model's latent space, improving neural network generalization and robustness by guiding focus towards salient features during training.
Contribution
The paper proposes SAGE, a new loss function that uses human saliency and contrastive embeddings in latent space, moving beyond image space guidance for better model training.
Findings
SAGE improves classification accuracy over state-of-the-art saliency methods.
It demonstrates robustness across different neural network architectures.
Effective in both open- and closed-set scenarios.
Abstract
Integrating human perceptual priors into the training of neural networks has been shown to raise model generalization, serve as an effective regularizer, and align models with human expertise for applications in high-risk domains. Existing approaches to integrate saliency into model training often rely on internal model mechanisms, which recent research suggests may be unreliable. Our insight is that many challenges associated with saliency-guided training stem from the placement of the guidance approaches solely within the image space. Instead, we move away from the image space, use the model's latent space embeddings to steer human guidance during training, and we propose SAGE (Saliency-Guided Contrastive Embeddings): a loss function that integrates human saliency into network training using contrastive embeddings. We apply salient-preserving and saliency-degrading signal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Visual Attention and Saliency Detection · Advanced Neural Network Applications
