Saliency for Fine-grained Object Recognition in Domains with Scarce Training Data
Carola Figueroa Flores, Abel Gonzalez-Garc\'ia, Joost van de Weijer, and Bogdan Raducanu

TL;DR
This paper demonstrates that incorporating a saliency branch into CNNs significantly enhances fine-grained object recognition accuracy, especially with limited training data, by acting as an attentional mechanism during feature extraction.
Contribution
It introduces a novel saliency-guided CNN architecture that improves recognition performance in data-scarce scenarios, validated through extensive experiments on multiple fine-grained datasets.
Findings
Saliency methods improve recognition accuracy with scarce data.
Saliency maps correlate with better recognition performance.
The proposed pipeline effectively evaluates saliency in high-level vision tasks.
Abstract
This paper investigates the role of saliency to improve the classification accuracy of a Convolutional Neural Network (CNN) for the case when scarce training data is available. Our approach consists in adding a saliency branch to an existing CNN architecture which is used to modulate the standard bottom-up visual features from the original image input, acting as an attentional mechanism that guides the feature extraction process. The main aim of the proposed approach is to enable the effective training of a fine-grained recognition model with limited training samples and to improve the performance on the task, thereby alleviating the need to annotate large dataset. % The vast majority of saliency methods are evaluated on their ability to generate saliency maps, and not on their functionality in a complete vision pipeline. Our proposed pipeline allows to evaluate saliency methods for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
