Part-level Car Parsing and Reconstruction from Single Street View

Qichuan Geng; Hong Zhang; Xinyu Huang; Sen Wang; Feixiang; Lu; Xinjing Cheng; Zhong Zhou; Ruigang Yang

arXiv:1811.10837·cs.CV·August 27, 2019·5 cites

Part-level Car Parsing and Reconstruction from Single Street View

Qichuan Geng, Hong Zhang, Xinyu Huang, Sen Wang, Feixiang, Lu, Xinjing Cheng, Zhong Zhou, Ruigang Yang

PDF

Open Access

TL;DR

This paper introduces a novel framework for 3D car shape, pose, and part estimation from single street view images, leveraging synthesized data and implicit feature transfer, achieving state-of-the-art results.

Contribution

It presents the first unified approach for 3D car reconstruction and pose estimation from street views using part features and a new high-quality dataset.

Findings

01

Significant improvement in part segmentation accuracy after implicit transfer.

02

State-of-the-art performance on ApolloCar3D dataset.

03

Outperforms existing methods by over 8 percentage points in mean A3DP-Abs.

Abstract

Part information has been shown to be resistant to occlusions and viewpoint changes, which is beneficial for various vision-related tasks. However, we found very limited work in car pose estimation and reconstruction from street views leveraging the part information. There are two major contributions in this paper. Firstly, we make the first attempt to build a framework to simultaneously estimate shape, translation, orientation, and semantic parts of cars in 3D space from a single street view. As it is labor-intensive to annotate semantic parts on real street views, we propose a specific approach to implicitly transfer part features from synthesized images to real street views. For pose and shape estimation, we propose a novel network structure that utilizes both part features and 3D losses. Secondly, we are the first to construct a high-quality dataset that contains 348 different car…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Neural Network Applications · Human Pose and Action Recognition