DR-WLC: Dimensionality Reduction cognition for object detection and pose   estimation by Watching, Learning and Checking

Yu Gao; Xi Xu; Tianji Jiang; Siyuan Chen; Yi Yang; Yufeng Yue; Mengyin; Fu

arXiv:2301.06944·cs.CV·January 18, 2023

DR-WLC: Dimensionality Reduction cognition for object detection and pose estimation by Watching, Learning and Checking

Yu Gao, Xi Xu, Tianji Jiang, Siyuan Chen, Yi Yang, Yufeng Yue, Mengyin, Fu

PDF

Open Access

TL;DR

This paper introduces DR-WLC, a dimensionality reduction cognitive model that enables simultaneous object detection and pose estimation using only 3D models and unlabeled images, reducing annotation costs.

Contribution

The proposed model uniquely combines object detection and pose estimation with minimal supervision, requiring only 3D models and unlabeled images, simplifying deployment.

Findings

01

Achieves object detection and pose estimation without manual annotations

02

Uses only 3D models and unlabeled images for training

03

Easily deployable for practical applications

Abstract

Object detection and pose estimation are difficult tasks in robotics and autonomous driving. Existing object detection and pose estimation methods mostly adopt the same-dimensional data for training. For example, 2D object detection usually requires a large amount of 2D annotation data with high cost. Using high-dimensional information to supervise lower-dimensional tasks is a feasible way to reduce datasets size. In this work, the DR-WLC, a dimensionality reduction cognitive model, which can perform both object detection and pose estimation tasks at the same time is proposed. The model only requires 3D model of objects and unlabeled environment images (with or without objects) to finish the training. In addition, a bounding boxes generation strategy is also proposed to build the relationship between 3D model and 2D object detection task. Experiments show that our method can qualify the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques