General Geometry-aware Weakly Supervised 3D Object Detection
Guowen Zhang, Junsong Fan, Liyi Chen, Zhaoxiang Zhang, Zhen Lei, Lei, Zhang

TL;DR
This paper introduces a general framework for weakly supervised 3D object detection from RGB images, leveraging large language models for priors and geometric constraints to achieve high-quality results with minimal annotation.
Contribution
A unified approach that integrates language model priors and geometric constraints for adaptable 3D detection from 2D annotations.
Findings
Achieves high-quality 3D bounding boxes on KITTI and SUN-RGBD datasets.
Outperforms existing weakly supervised methods in accuracy.
Demonstrates robustness across different scenes and categories.
Abstract
3D object detection is an indispensable component for scene understanding. However, the annotation of large-scale 3D datasets requires significant human effort. To tackle this problem, many methods adopt weakly supervised 3D object detection that estimates 3D boxes by leveraging 2D boxes and scene/class-specific priors. However, these approaches generally depend on sophisticated manual priors, which is hard to generalize to novel categories and scenes. In this paper, we are motivated to propose a general approach, which can be easily adapted to new scenes and/or classes. A unified framework is developed for learning 3D object detectors from RGB images and associated 2D boxes. In specific, we propose three general components: prior injection module to obtain general object geometric priors from LLM model, 2D space projection constraint to minimize the discrepancy between the boundaries…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Object Detection Techniques · Medical Image Segmentation Techniques · Advanced Neural Network Applications
