PERCH: Perception via Search for Multi-Object Recognition and Localization
Venkatraman Narayanan, Maxim Likhachev

TL;DR
PERCH introduces a generative, search-based approach for multi-object recognition and localization that guarantees optimal scene explanation and performs well under occlusion, surpassing traditional discriminative methods.
Contribution
The paper presents a novel optimization framework for multi-object recognition, formulating it as a combinatorial search on a Monotone Scene Generation Tree, with guarantees of optimality.
Findings
Successfully localizes objects under heavy occlusion.
Outperforms state-of-the-art methods in challenging scenarios.
Guarantees the best scene explanation under the defined cost function.
Abstract
In many robotic domains such as flexible automated manufacturing or personal assistance, a fundamental perception task is that of identifying and localizing objects whose 3D models are known. Canonical approaches to this problem include discriminative methods that find correspondences between feature descriptors computed over the model and observed data. While these methods have been employed successfully, they can be unreliable when the feature descriptors fail to capture variations in observed data; a classic cause being occlusion. As a step towards deliberative reasoning, we present PERCH: PErception via SeaRCH, an algorithm that seeks to find the best explanation of the observed sensor data by hypothesizing possible scenes in a generative fashion. Our contributions are: i) formulating the multi-object recognition and localization task as an optimization problem over the space of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
