Understanding Road Layout from Videos as a Whole
Buyu Liu, Bingbing Zhuang, Samuel Schulter, Pan Ji, Manmohan, Chandraker

TL;DR
This paper presents a novel approach for inferring complex road layouts from videos by integrating camera motion, context cues, and long-term information, significantly improving accuracy and consistency over previous methods.
Contribution
It introduces a new model combining LSTM and Feature Transform Module that leverages camera motion and context cues for consistent, accurate road layout prediction from videos.
Findings
Incorporating global and contextual cues enhances prediction accuracy.
LSTM and FTM modules improve prediction consistency.
Proposed method outperforms state-of-the-art techniques significantly.
Abstract
In this paper, we address the problem of inferring the layout of complex road scenes from video sequences. To this end, we formulate it as a top-view road attributes prediction problem and our goal is to predict these attributes for each frame both accurately and consistently. In contrast to prior work, we exploit the following three novel aspects: leveraging camera motions in videos, including context cuesand incorporating long-term video information. Specifically, we introduce a model that aims to enforce prediction consistency in videos. Our model consists of one LSTM and one Feature Transform Module (FTM). The former implicitly incorporates the consistency constraint with its hidden states, and the latter explicitly takes the camera motion into consideration when aggregating information along videos. Moreover, we propose to incorporate context information by introducing road…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Understanding Road Layout From Videos as a Whole· youtube
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Video Analysis and Summarization
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory
