Future Semantic Segmentation with Convolutional LSTM

Seyed shahabeddin Nabavi; Mrigank Rochan; Yang; Wang

arXiv:1807.07946·cs.CV·July 23, 2018·31 cites

Future Semantic Segmentation with Convolutional LSTM

Seyed shahabeddin Nabavi, Mrigank Rochan, Yang, Wang

PDF

Open Access

TL;DR

This paper introduces a convolutional LSTM-based model for predicting future semantic segmentation in videos, enhancing real-time decision-making applications like autonomous driving by capturing spatiotemporal information.

Contribution

The paper presents a novel ConvLSTM model for future semantic segmentation and extends it with bidirectional ConvLSTM to improve temporal context understanding.

Findings

01

Outperforms state-of-the-art methods on benchmark datasets

02

Effectively captures spatiotemporal information for future frame prediction

03

Bidirectional ConvLSTM improves prediction accuracy

Abstract

We consider the problem of predicting semantic segmentation of future frames in a video. Given several observed frames in a video, our goal is to predict the semantic segmentation map of future frames that are not yet observed. A reliable solution to this problem is useful in many applications that require real-time decision making, such as autonomous driving. We propose a novel model that uses convolutional LSTM (ConvLSTM) to encode the spatiotemporal information of observed frames for future prediction. We also extend our model to use bidirectional ConvLSTM to capture temporal information in both directions. Our proposed approach outperforms other state-of-the-art methods on the benchmark dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Visual Attention and Saliency Detection · Domain Adaptation and Few-Shot Learning

MethodsConvolution · ConvLSTM · Sigmoid Activation · Tanh Activation · Long Short-Term Memory