Multi-Class Detection and Segmentation of Objects in Depth
Cheng Zhang, Hedvig Kjellstrom

TL;DR
This paper introduces a novel 3D multi-class object detection and segmentation method using RGB-D data, enhancing robustness and providing accurate 3D localization for humanoid robots in home environments.
Contribution
It presents a minimal joint codebook for multi-class detection and integrates depth information to improve robustness and 3D localization capabilities.
Findings
Improved detection efficiency for multiple classes
Enhanced robustness with depth information
Accurate 3D object localization
Abstract
The quality of life of many people could be improved by autonomous humanoid robots in the home. To function in the human world, a humanoid household robot must be able to locate itself and perceive the environment like a human; scene perception, object detection and segmentation, and object spatial localization in 3D are fundamental capabilities for such humanoid robots. This paper presents a 3D multi-class object detection and segmentation method. The contributions are twofold. Firstly, we present a multi-class detection method, where a minimal joint codebook is learned in a principled manner. Secondly, we incorporate depth information using RGB-D imagery, which increases the robustness of the method and gives the 3D location of objects -- necessary since the robot reasons in 3D space. Experiments show that the multi-class extension improves the detection efficiency with respect to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization · Image and Object Detection Techniques
