The Stixel world: A medium-level representation of traffic scenes
Marius Cordts, Timo Rehfeld, Lukas Schneider, David Pfeiffer, Markus, Enzweiler, Stefan Roth, Marc Pollefeys, Uwe Franke

TL;DR
The paper introduces the Stixel World, a medium-level, structured representation of traffic scenes that compresses obstacle information for efficient automotive vision applications, and extends it to multiple sensor inputs.
Contribution
It generalizes the Stixel model to incorporate multiple dense input streams and presents a new mathematical formulation and learning approach using structured SVMs.
Findings
Proven usefulness in object detection, tracking, segmentation, and mapping.
Extended model to handle stereo depth and semantic class maps.
New structured SVM-based parameter learning method.
Abstract
Recent progress in advanced driver assistance systems and the race towards autonomous vehicles is mainly driven by two factors: (1) increasingly sophisticated algorithms that interpret the environment around the vehicle and react accordingly, and (2) the continuous improvements of sensor technology itself. In terms of cameras, these improvements typically include higher spatial resolution, which as a consequence requires more data to be processed. The trend to add multiple cameras to cover the entire surrounding of the vehicle is not conducive in that matter. At the same time, an increasing number of special purpose algorithms need access to the sensor input data to correctly interpret the various complex situations that can occur, particularly in urban traffic. By observing those trends, it becomes clear that a key challenge for vision architectures in intelligent vehicles is to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
See pages 1-last of Cordts_StixelWorld_IVC.pdf
