OmniVTA: Visuo-Tactile World Modeling for Contact-Rich Robotic Manipulation
Yuhang Zheng, Songen Gu, Weize Li, Yupeng Zheng, Yujie Zang, Shuai Tian, Xiang Li, Ce Hao, Chen Gao, Si Liu, Haoran Li, Yilun Chen, Shuicheng Yan, Wenchao Ding

TL;DR
OmniVTA introduces a large-scale visuo-tactile dataset and a novel world-model framework that enhances contact-rich robotic manipulation by integrating predictive contact modeling with high-frequency tactile feedback.
Contribution
The paper presents OmniViTac, a comprehensive visuo-tactile dataset, and OmniVTA, a new world-model-based framework that improves manipulation performance through integrated contact dynamics modeling.
Findings
OmniVTA outperforms existing methods in contact-rich tasks.
The framework generalizes well to unseen objects and configurations.
Closed-loop tactile feedback improves manipulation accuracy.
Abstract
Contact-rich manipulation tasks, such as wiping and assembly, require accurate perception of contact forces, friction changes, and state transitions that cannot be reliably inferred from vision alone. Despite growing interest in visuo-tactile manipulation, progress is constrained by two persistent limitations: existing datasets are small in scale and narrow in task coverage, and current methods treat tactile signals as passive observations rather than using them to model contact dynamics or enable closed-loop control explicitly. In this paper, we present \textbf{OmniViTac}, a large-scale visuo-tactile-action dataset comprising trajectories across tasks and objects, organized into six physics-grounded interaction patterns. Building on this dataset, we propose \textbf{OmniVTA}, a world-model-based visuo-tactile manipulation framework that integrates four tightly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Tactile and Sensory Interactions · Advanced Sensor and Energy Harvesting Materials
