TL;DR
This paper introduces a new fixation prediction framework that leverages inter-image similarities and an ensemble of Extreme Learning Machines trained on similar images, improving saliency prediction by incorporating scene context and memorability.
Contribution
It proposes a novel ensemble-based saliency modeling approach using deep features and inter-image similarity, enhancing fixation prediction accuracy.
Findings
Ensemble of ELMs trained on similar images improves saliency prediction.
Inter-image similarity effectively incorporates scene context and memorability.
The method outperforms existing fixation prediction models.
Abstract
This paper presents a novel fixation prediction and saliency modeling framework based on inter-image similarities and ensemble of Extreme Learning Machines (ELM). The proposed framework is inspired by two observations, 1) the contextual information of a scene along with low-level visual cues modulates attention, 2) the influence of scene memorability on eye movement patterns caused by the resemblance of a scene to a former visual experience. Motivated by such observations, we develop a framework that estimates the saliency of a given image using an ensemble of extreme learners, each trained on an image similar to the input image. That is, after retrieving a set of similar images for a given image, a saliency predictor is learnt from each of the images in the retrieved image set using an ELM, resulting in an ensemble. The saliency of the given image is then measured in terms of the mean…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
