Learning to Identify Object Instances by Touch: Tactile Recognition via Multimodal Matching
Justin Lin, Roberto Calandra, and Sergey Levine

TL;DR
This paper introduces a novel multimodal recognition system that uses tactile and visual data to identify objects, demonstrating high accuracy even with unseen objects and outperforming human volunteers.
Contribution
It is the first large-scale study addressing touch-based instance recognition using multimodal data, with a self-supervised data collection approach and a model that surpasses existing methods.
Findings
Accurately recognizes objects by touch alone.
Outperforms human volunteers in object recognition.
Effective on novel, unseen objects.
Abstract
Much of the literature on robotic perception focuses on the visual modality. Vision provides a global observation of a scene, making it broadly useful. However, in the domain of robotic manipulation, vision alone can sometimes prove inadequate: in the presence of occlusions or poor lighting, visual object identification might be difficult. The sense of touch can provide robots with an alternative mechanism for recognizing objects. In this paper, we study the problem of touch-based instance recognition. We propose a novel framing of the problem as multi-modal recognition: the goal of our system is to recognize, given a visual and tactile observation, whether or not these observations correspond to the same object. To our knowledge, our work is the first to address this type of multi-modal instance recognition problem on such a large-scale with our analysis spanning 98 different objects.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTactile and Sensory Interactions · Robot Manipulation and Learning · EEG and Brain-Computer Interfaces
