Efficient Long-Short Temporal Attention Network for Unsupervised Video Object Segmentation
Ping Li, Yu Zhang, Li Yuan, Huaxin Xiao, Binbin Lin and, Xianghua Xu

TL;DR
This paper introduces an efficient Long-Short Temporal Attention network (LSTA) for unsupervised video object segmentation, effectively capturing spatial-temporal context and enabling real-time processing.
Contribution
The paper proposes a novel LSTA model with long-term memory and short-term attention modules, achieving high efficiency and promising results in unsupervised VOS.
Findings
High efficiency with nearly linear time complexity
Effective modeling of appearance and motion patterns
Promising performance on multiple benchmarks
Abstract
Unsupervised Video Object Segmentation (VOS) aims at identifying the contours of primary foreground objects in videos without any prior knowledge. However, previous methods do not fully use spatial-temporal context and fail to tackle this challenging task in real-time. This motivates us to develop an efficient Long-Short Temporal Attention network (termed LSTA) for unsupervised VOS task from a holistic view. Specifically, LSTA consists of two dominant modules, i.e., Long Temporal Memory and Short Temporal Attention. The former captures the long-term global pixel relations of the past frames and the current frame, which models constantly present objects by encoding appearance pattern. Meanwhile, the latter reveals the short-term local pixel relations of one nearby frame and the current frame, which models moving objects by encoding motion pattern. To speedup the inference, the efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Video Surveillance and Tracking Methods · Image Enhancement Techniques
Methodsfail · VOS
