Monocular Outdoor Semantic Mapping with a Multi-task Network
Yucai Bai, Lei Fan, Ziyu Pan, Long Chen

TL;DR
This paper introduces a multi-task CNN for monocular outdoor semantic mapping, enabling large-scale 3D reconstruction by jointly predicting depth and semantics from a single image stream, with post-processing for dense maps.
Contribution
It proposes a novel multi-task CNN with ASPP and residual connections for joint depth and semantic prediction from monocular images, improving outdoor semantic mapping.
Findings
Outperforms existing methods in semantic labeling and depth prediction
Produces dense, large-scale 3D semantic maps from monocular sequences
Demonstrates effectiveness on challenging outdoor datasets
Abstract
In many robotic applications, especially for the autonomous driving, understanding the semantic information and the geometric structure of surroundings are both essential. Semantic 3D maps, as a carrier of the environmental knowledge, are then intensively studied for their abilities and applications. However, it is still challenging to produce a dense outdoor semantic map from a monocular image stream. Motivated by this target, in this paper, we propose a method for large-scale 3D reconstruction from consecutive monocular images. First, with the correlation of underlying information between depth and semantic prediction, a novel multi-task Convolutional Neural Network (CNN) is designed for joint prediction. Given a single image, the network learns low-level information with a shared encoder and separately predicts with decoders containing additional Atrous Spatial Pyramid Pooling (ASPP)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques
MethodsSpatial Pyramid Pooling · Residual Connection
