Data Augmentation via Latent Diffusion for Saliency Prediction
Bahar Aydemir, Deblina Bhattacharjee, Tong Zhang, Mathieu Salzmann,, Sabine S\"usstrunk

TL;DR
This paper introduces a novel data augmentation technique using latent diffusion and saliency-guided cross-attention to improve deep saliency prediction models by editing images while preserving scene complexity.
Contribution
It presents a new augmentation method that edits images based on photometric and semantic features, enhancing model performance and aligning predictions with human attention.
Findings
Consistent performance improvement across various models
Superior results on public saliency benchmarks
Predictions closely match human visual attention
Abstract
Saliency prediction models are constrained by the limited diversity and quantity of labeled data. Standard data augmentation techniques such as rotating and cropping alter scene composition, affecting saliency. We propose a novel data augmentation method for deep saliency prediction that edits natural images while preserving the complexity and variability of real-world scenes. Since saliency depends on high-level and low-level features, our approach involves learning both by incorporating photometric and semantic attributes such as color, contrast, brightness, and class. To that end, we introduce a saliency-guided cross-attention mechanism that enables targeted edits on the photometric properties, thereby enhancing saliency within specific image regions. Experimental results show that our data augmentation method consistently improves the performance of various saliency models.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Visual Attention and Saliency Detection · Diverse Topics in Contemporary Research
MethodsSoftmax · Attention Is All You Need · ALIGN
