Part-level Car Parsing and Reconstruction from Single Street View
Qichuan Geng, Hong Zhang, Xinyu Huang, Sen Wang, Feixiang, Lu, Xinjing Cheng, Zhong Zhou, Ruigang Yang

TL;DR
This paper introduces a novel framework for 3D car shape, pose, and part estimation from single street view images, leveraging synthesized data and implicit feature transfer, achieving state-of-the-art results.
Contribution
It presents the first unified approach for 3D car reconstruction and pose estimation from street views using part features and a new high-quality dataset.
Findings
Significant improvement in part segmentation accuracy after implicit transfer.
State-of-the-art performance on ApolloCar3D dataset.
Outperforms existing methods by over 8 percentage points in mean A3DP-Abs.
Abstract
Part information has been shown to be resistant to occlusions and viewpoint changes, which is beneficial for various vision-related tasks. However, we found very limited work in car pose estimation and reconstruction from street views leveraging the part information. There are two major contributions in this paper. Firstly, we make the first attempt to build a framework to simultaneously estimate shape, translation, orientation, and semantic parts of cars in 3D space from a single street view. As it is labor-intensive to annotate semantic parts on real street views, we propose a specific approach to implicitly transfer part features from synthesized images to real street views. For pose and shape estimation, we propose a novel network structure that utilizes both part features and 3D losses. Secondly, we are the first to construct a high-quality dataset that contains 348 different car…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Advanced Neural Network Applications · Human Pose and Action Recognition
