Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation
Yuyang Li, Yinghan Chen, Zihang Zhao, Puhao Li, Tengyu Liu, Siyuan Huang, Yixin Zhu

TL;DR
This paper presents TacThru, a novel sensor and learning framework that combines tactile and visual perception for improved robotic manipulation, achieving high success rates in complex real-world tasks.
Contribution
Introduction of TacThru sensor enabling simultaneous multimodal perception and TacThru-UMI framework leveraging these signals with a Transformer-based policy.
Findings
Achieved an average success rate of 85.5% on five tasks.
Outperformed tactile-only (66.3%) and vision-only (55.4%) policies.
Excelled in contact detection and precision manipulation scenarios.
Abstract
Robotic manipulation requires both rich multimodal perception and effective learning frameworks to handle complex real-world tasks. See-through-skin (STS) sensors, which combine tactile and visual perception, offer promising sensing capabilities, while modern imitation learning provides powerful tools for policy acquisition. However, existing STS designs lack simultaneous multimodal perception and suffer from unreliable tactile tracking. Furthermore, integrating these rich multimodal signals into learning-based manipulation pipelines remains an open challenge. We introduce TacThru, an STS sensor enabling simultaneous visual perception and robust tactile signal extraction, and TacThru-UMI, an imitation learning framework that leverages these multimodal signals for manipulation. Our sensor features a fully transparent elastomer, persistent illumination, novel keyline markers, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Sensor and Energy Harvesting Materials · Robot Manipulation and Learning · Tactile and Sensory Interactions
