Memory-Efficient Continual Learning Object Segmentation for Long Video
Amir Nazemi, Mohammad Javad Shafiee, Zahra Gharaee, Paul Fieguth

TL;DR
This paper introduces two novel memory-efficient continual learning techniques, GRCL and RMSCL, that enhance online video object segmentation performance on long videos while reducing memory usage.
Contribution
The paper proposes two new methods, GRCL and RMSCL, to improve memory efficiency and accuracy of online VOS in long videos, addressing memory limitations and representation drift.
Findings
Improved VOS performance by over 8% on long videos.
Enhanced robustness on long-video datasets.
Maintained performance on short-video datasets.
Abstract
Recent state-of-the-art semi-supervised Video Object Segmentation (VOS) methods have shown significant improvements in target object segmentation accuracy when information from preceding frames is used in segmenting the current frame. In particular, such memory-based approaches can help a model to more effectively handle appearance changes (representation drift) or occlusions. Ideally, for maximum performance, Online VOS methods would need all or most of the preceding frames (or their extracted information) to be stored in memory and be used for online learning in later frames. Such a solution is not feasible for long videos, as the required memory size grows without bound, and such methods can fail when memory is limited and a target object experiences repeated representation drifts throughout a video. We propose two novel techniques to reduce the memory requirement of Online VOS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Advanced Neural Network Applications
MethodsVOS · fail
