Planning Visual-Tactile Precision Grasps via Complementary Use of Vision and Touch
Martin Matak, Tucker Hermans

TL;DR
This paper presents a novel approach for planning and executing visual-tactile precision grasps on unseen objects by combining visual surface estimation with tactile feedback, improving grasp success without requiring explicit object models.
Contribution
The method explicitly reasons about fingertip contact points using visual estimates and tactile feedback, enabling adaptive grasping without detailed object models or integrated surface estimates.
Findings
Successfully grasped unseen objects using a single camera view
Outperformed state-of-the-art multi-fingered grasp planners
Achieved higher grasp success rates with tactile feedback integration
Abstract
Reliably planning fingertip grasps for multi-fingered hands lies as a key challenge for many tasks including tool use, insertion, and dexterous in-hand manipulation. This task becomes even more difficult when the robot lacks an accurate model of the object to be grasped. Tactile sensing offers a promising approach to account for uncertainties in object shape. However, current robotic hands tend to lack full tactile coverage. As such, a problem arises of how to plan and execute grasps for multi-fingered hands such that contact is made with the area covered by the tactile sensors. To address this issue, we propose an approach to grasp planning that explicitly reasons about where the fingertips should contact the estimated object surface while maximizing the probability of grasp success. Key to our method's success is the use of visual surface estimation for initial planning to encode the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Tactile and Sensory Interactions · Mobile Crowdsensing and Crowdsourcing
