Simultaneous 3D Object Segmentation and 6-DOF Pose Estimation
Hongsen Liu

TL;DR
This paper introduces a single-shot 3D point cloud method that simultaneously segments objects and estimates their 6-DOF poses, utilizing augmented reality for training data generation and a multi-task CNN architecture.
Contribution
It presents a novel, concise approach that avoids 2D projections and spatial transformations, leveraging augmented reality for training data in 3D object detection and pose estimation.
Findings
Achieves comparable or superior performance to state-of-the-art methods.
Effectively generalizes across multiple datasets and scenarios.
Utilizes augmented reality to generate training data in semi-virtual 3D space.
Abstract
We propose a single-shot method for simultaneous 3D object segmentation and 6-DOF pose estimation in pure 3D point clouds scenes based on a consensus that \emph{one point only belongs to one object}, i.e., each point has the potential power to predict the 6-DOF pose of its corresponding object. Unlike the recently proposed methods of the similar task, which rely on 2D detectors to predict the projection of 3D corners of the 3D bounding boxes and the 6-DOF pose must be estimated by a PnP like spatial transformation method, ours is concise enough not to require additional spatial transformation between different dimensions. Due to the lack of training data for many objects, the recently proposed 2D detection methods try to generate training data by using rendering engine and achieve good results. However, rendering in 3D space along with 6-DOF is relatively difficult. Therefore, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Robot Manipulation and Learning
