Learning Precise, Contact-Rich Manipulation through Uncalibrated Tactile Skins
Venkatesh Pattabiraman, Yifeng Cao, Siddhant Haldar, Lerrel Pinto,, Raunaq Bhirangi

TL;DR
This paper introduces ViSk, a transformer-based framework that integrates magnetic tactile sensors with visual data to improve precise, contact-rich robotic manipulation, outperforming vision-only and optical tactile methods across multiple real-world tasks.
Contribution
The work presents a novel low-dimensional magnetic skin sensor and a transformer-based policy framework that effectively combines tactile and visual data for manipulation tasks.
Findings
ViSk outperforms vision-only policies.
Combining tactile and visual data improves performance.
27.5% average improvement across tasks.
Abstract
While visuomotor policy learning has advanced robotic manipulation, precisely executing contact-rich tasks remains challenging due to the limitations of vision in reasoning about physical interactions. To address this, recent work has sought to integrate tactile sensing into policy learning. However, many existing approaches rely on optical tactile sensors that are either restricted to recognition tasks or require complex dimensionality reduction steps for policy learning. In this work, we explore learning policies with magnetic skin sensors, which are inherently low-dimensional, highly sensitive, and inexpensive to integrate with robotic platforms. To leverage these sensors effectively, we present the Visuo-Skin (ViSk) framework, a simple approach that uses a transformer-based policy and treats skin sensor data as additional tokens alongside visual information. Evaluated on four…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTactile and Sensory Interactions
