Seeing by haptic glance: reinforcement learning-based 3D object Recognition
Kevin Riou, Suiyi Ling, Guillaume Gallot, Patrick Le Callet

TL;DR
This paper introduces a reinforcement learning framework enabling robots to efficiently recognize 3D objects through limited haptic interactions by actively selecting key points, mimicking human 'haptic glance' capabilities.
Contribution
It presents a novel RL-based method for active 3D recognition with sparse haptic data, optimizing exploration and recognition simultaneously.
Findings
Outperforms existing models in 3D recognition accuracy
Efficiently identifies key 3D points with limited interactions
Demonstrates effectiveness in realistic haptic exploration scenarios
Abstract
Human is able to conduct 3D recognition by a limited number of haptic contacts between the target object and his/her fingers without seeing the object. This capability is defined as `haptic glance' in cognitive neuroscience. Most of the existing 3D recognition models were developed based on dense 3D data. Nonetheless, in many real-life use cases, where robots are used to collect 3D data by haptic exploration, only a limited number of 3D points could be collected. In this study, we thus focus on solving the intractable problem of how to obtain cognitively representative 3D key-points of a target object with limited interactions between the robot and the object. A novel reinforcement learning based framework is proposed, where the haptic exploration procedure (the agent iteratively predicts the next position for the robot to explore) is optimized simultaneously with the objective 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
