SG-FCN: A Motion and Memory-Based Deep Learning Model for Video Saliency Detection
Meijun Sun, Ziqi Zhou, QinGhua Hu, Zheng Wang, Jianmin Jiang

TL;DR
This paper introduces SG-FCN, a deep learning model that integrates motion and memory mechanisms to enhance video saliency detection, outperforming existing methods across multiple datasets.
Contribution
The paper presents a novel fully convolutional network that simulates human memory and attention to improve video fixation detection performance.
Findings
Outperforms 11 state-of-the-art methods on multiple datasets
Effectively integrates motion and temporal information
Demonstrates superior accuracy in complex scenes
Abstract
Data-driven saliency detection has attracted strong interest as a result of applying convolutional neural networks to the detection of eye fixations. Although a number of imagebased salient object and fixation detection models have been proposed, video fixation detection still requires more exploration. Different from image analysis, motion and temporal information is a crucial factor affecting human attention when viewing video sequences. Although existing models based on local contrast and low-level features have been extensively researched, they failed to simultaneously consider interframe motion and temporal information across neighboring video frames, leading to unsatisfactory performance when handling complex scenes. To this end, we propose a novel and efficient video eye fixation detection model to improve the saliency detection performance. By simulating the memory mechanism and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
