# Segmenting the Future

**Authors:** Hsu-kuang Chiu, Ehsan Adeli, Juan Carlos Niebles

arXiv: 1904.10666 · 2019-12-13

## TL;DR

This paper introduces a novel end-to-end model that predicts future semantic segmentation from past RGB video frames, enhancing scene understanding for autonomous systems.

## Contribution

It presents a temporal encoder-decoder architecture combined with a new knowledge distillation framework for future semantic segmentation prediction.

## Key findings

- Outperforms baseline methods on Cityscapes and Apolloscape datasets.
- Implicitly models scene segments and object dynamics from preceding frames.
- Achieves state-of-the-art accuracy in future scene segmentation prediction.

## Abstract

Predicting the future is an important aspect for decision-making in robotics or autonomous driving systems, which heavily rely upon visual scene understanding. While prior work attempts to predict future video pixels, anticipate activities or forecast future scene semantic segments from segmentation of the preceding frames, methods that predict future semantic segmentation solely from the previous frame RGB data in a single end-to-end trainable model do not exist. In this paper, we propose a temporal encoder-decoder network architecture that encodes RGB frames from the past and decodes the future semantic segmentation. The network is coupled with a new knowledge distillation training framework specific for the forecasting task. Our method, only seeing preceding video frames, implicitly models the scene segments while simultaneously accounting for the object dynamics to infer the future scene semantic segments. Our results on Cityscapes and Apolloscape outperform the baseline and current state-of-the-art methods. Code is available at https://github.com/eddyhkchiu/segmenting_the_future/.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.10666/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/1904.10666/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/1904.10666/full.md

---
Source: https://tomesphere.com/paper/1904.10666