Smart Explorer: Recognizing Objects in Dense Clutter via Interactive Exploration
Zhenyu Wu, Ziwei Wang, Zibu Wei, Yi Wei, Haibin Yan

TL;DR
Smart Explorer is an interactive framework that improves object recognition in dense clutter by physically manipulating the environment to reduce occlusion and ambiguity, achieving high accuracy with minimal actions.
Contribution
It introduces an interactive exploration method combining multi-view RGB-D data and physical actions to enhance recognition in cluttered scenes, outperforming non-interactive approaches.
Findings
Achieves high recognition accuracy with few actions.
Outperforms random pushing in dense clutter recognition.
Effectively reduces occlusion and ambiguity through physical interaction.
Abstract
Recognizing objects in dense clutter accurately plays an important role to a wide variety of robotic manipulation tasks including grasping, packing, rearranging and many others. However, conventional visual recognition models usually miss objects because of the significant occlusion among instances and causes incorrect prediction due to the visual ambiguity with the high object crowdedness. In this paper, we propose an interactive exploration framework called Smart Explorer for recognizing all objects in dense clutters. Our Smart Explorer physically interacts with the clutter to maximize the recognition performance while minimize the number of motions, where the false positives and negatives can be alleviated effectively with the optimal accuracy-efficiency trade-offs. Specifically, we first collect the multi-view RGB-D images of the clutter and reconstruct the corresponding point…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Robot Manipulation and Learning · Advanced Image and Video Retrieval Techniques
