PERCH 2.0 : Fast and Accurate GPU-based Perception via Search for Object Pose Estimation
Aditya Agarwal, Yupeng Han, Maxim Likhachev

TL;DR
PERCH 2.0 introduces a GPU-accelerated, RGB-inclusive search-based method for fast, accurate 6-DoF object pose estimation that outperforms previous approaches without requiring extensive training data.
Contribution
It presents PERCH 2.0, a novel perception via search strategy leveraging GPU and RGB data to significantly improve speed and accuracy in object pose estimation.
Findings
Achieves 100x speedup over original PERCH
Outperforms state-of-the-art data-driven methods in accuracy
Operates without needing annotated ground truth poses
Abstract
Pose estimation of known objects is fundamental to tasks such as robotic grasping and manipulation. The need for reliable grasping imposes stringent accuracy requirements on pose estimation in cluttered, occluded scenes in dynamic environments. Modern methods employ large sets of training data to learn features in order to find correspondence between 3D models and observed data. However these methods require extensive annotation of ground truth poses. An alternative is to use algorithms that search for the best explanation of the observed scene in a space of possible rendered scenes. A recently developed algorithm, PERCH (PErception Via SeaRCH) does so by using depth data to converge to a globally optimum solution using a search over a specially constructed tree. While PERCH offers strong guarantees on accuracy, the current formulation suffers from low scalability owing to its high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
