Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation

Bowen Wang; Liangzhi Li; Yuta Nakashima; Ryo Kawasaki; Hajime; Nagahara; Yasushi Yagi

arXiv:2010.09466·cs.CV·October 20, 2020

Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation

Bowen Wang, Liangzhi Li, Yuta Nakashima, Ryo Kawasaki, Hajime, Nagahara, Yasushi Yagi

PDF

Open Access

TL;DR

Noisy-LSTM introduces a novel training strategy that enhances temporal awareness in video semantic segmentation by using noise-injected frames, leading to state-of-the-art results without extra data or computational costs.

Contribution

The paper proposes Noisy-LSTM, a new end-to-end trainable model with a noise-based training strategy to improve temporal feature extraction in video segmentation.

Findings

01

Achieves state-of-the-art performance on CityScapes dataset.

02

Effective regularization without additional data or computational costs.

03

Improves temporal coherence handling in video segmentation.

Abstract

Semantic video segmentation is a key challenge for various applications. This paper presents a new model named Noisy-LSTM, which is trainable in an end-to-end manner, with convolutional LSTMs (ConvLSTMs) to leverage the temporal coherency in video frames. We also present a simple yet effective training strategy, which replaces a frame in video sequence with noises. This strategy spoils the temporal coherency in video frames during training and thus makes the temporal links in ConvLSTMs unreliable, which may consequently improve feature extraction from video frames, as well as serve as a regularizer to avoid overfitting, without requiring extra data annotation or computational costs. Experimental results demonstrate that the proposed model can achieve state-of-the-art performances in both the CityScapes and EndoVis2018 datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods