Improving Annotation for 3D Pose Dataset of Fine-Grained Object Categories
Yaming Wang, Xiao Tan, Yi Yang, Ziyu Li, Xiao Liu, Feng Zhou, Larry S., Davis

TL;DR
This paper introduces a large-scale, fine-grained 3D pose dataset with 409 categories and nearly 32,000 images, enhancing the detail and accuracy of pose annotations for object recognition tasks.
Contribution
The work creates a new dataset by augmenting existing fine-grained recognition datasets with precise 3D pose annotations using a combination of manual adjustment and optimization techniques.
Findings
Dataset contains 409 categories and 31,881 images.
Annotations are validated with qualitative and quantitative analysis.
Provides a resource for advancing 3D pose estimation research.
Abstract
Existing 3D pose datasets of object categories are limited to generic object types and lack of fine-grained information. In this work, we introduce a new large-scale dataset that consists of 409 fine-grained categories and 31,881 images with accurate 3D pose annotation. Specifically, we augment three existing fine-grained object recognition datasets (StanfordCars, CompCars and FGVC-Aircraft) by finding a specific 3D model for each sub-category from ShapeNet and manually annotating each 2D image by adjusting a full set of 7 continuous perspective parameters. Since the fine-grained shapes allow 3D models to better fit the images, we further improve the annotation quality by initializing from the human annotation and conducting local search of the pose parameters with the objective of maximizing the IoUs between the projected mask and the segmentation reference estimated from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Image Processing and 3D Reconstruction
