Modeling Saliency Dataset Bias
Matthias K\"ummerer, Harneet Singh Khanuja, Matthias Bethge

TL;DR
This paper investigates dataset bias in image saliency prediction, revealing significant generalization challenges across datasets and proposing a novel, minimally dataset-specific model that improves state-of-the-art performance and offers insights into saliency properties.
Contribution
It introduces a new architecture with minimal dataset-specific parameters that effectively addresses dataset bias and enhances generalization in saliency prediction.
Findings
Significant performance drop (~40%) when applying models across different datasets.
Increasing dataset diversity alone does not eliminate dataset bias.
The proposed model improves state-of-the-art results and generalizes well even with limited adaptation samples.
Abstract
Recent advances in image-based saliency prediction are approaching gold standard performance levels on existing benchmarks. Despite this success, we show that predicting fixations across multiple saliency datasets remains challenging due to dataset bias. We find a significant performance drop (around 40%) when models trained on one dataset are applied to another. Surprisingly, increasing dataset diversity does not resolve this inter-dataset gap, with close to 60% attributed to dataset-specific biases. To address this remaining generalization gap, we propose a novel architecture extending a mostly dataset-agnostic encoder-decoder structure with fewer than 20 dataset-specific parameters that govern interpretable mechanisms such as multi-scale structure, center bias, and fixation spread. Adapting only these parameters to new data accounts for more than 75% of the generalization gap, with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection
