SMOOT: Saliency Guided Mask Optimized Online Training

Ali Karkehabadi; Houman Homayoun; Avesta Sasan

arXiv:2310.00772·cs.CV·October 12, 2023·1 cites

SMOOT: Saliency Guided Mask Optimized Online Training

Ali Karkehabadi, Houman Homayoun, Avesta Sasan

PDF

Open Access

TL;DR

SMOOT introduces a saliency-guided mask optimization technique that dynamically determines the optimal masking level during training, enhancing model accuracy and interpretability in deep neural networks.

Contribution

The paper presents a novel method to adaptively select the number of masked inputs based on training metrics, improving saliency relevance and model performance.

Findings

01

Improved model accuracy with saliency-guided masking.

02

Enhanced interpretability of neural networks through better saliency maps.

03

Demonstrated effectiveness on image classification tasks.

Abstract

Deep Neural Networks are powerful tools for understanding complex patterns and making decisions. However, their black-box nature impedes a complete understanding of their inner workings. Saliency-Guided Training (SGT) methods try to highlight the prominent features in the model's training based on the output to alleviate this problem. These methods use back-propagation and modified gradients to guide the model toward the most relevant features while keeping the impact on the prediction accuracy negligible. SGT makes the model's final result more interpretable by masking input partially. In this way, considering the model's output, we can infer how each segment of the input affects the output. In the particular case of image as the input, masking is applied to the input pixels. However, the masking strategy and number of pixels which we mask, are considered as a hyperparameter.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques

MethodsFocus