Learning to Feel the Future: DreamTacVLA for Contact-Rich Manipulation

Guo Ye; Zexi Zhang; Xu Zhao; Shang Wu; Haoran Lu; Shihan Lu; Han Liu

arXiv:2512.23864·cs.RO·May 7, 2026

Learning to Feel the Future: DreamTacVLA for Contact-Rich Manipulation

Guo Ye, Zexi Zhang, Xu Zhao, Shang Wu, Haoran Lu, Shihan Lu, Han Liu

PDF

1 Repo

TL;DR

DreamTacVLA enhances vision-language-action models with contact physics understanding by integrating high-resolution tactile sensing, hierarchical perception, and future tactile prediction, significantly improving contact-rich manipulation performance.

Contribution

It introduces a hierarchical perception framework with tactile world modeling and multi-scale sensory alignment, advancing contact-aware robotic manipulation.

Findings

01

Achieves up to 95% success in contact-rich tasks.

02

Outperforms state-of-the-art VLA models.

03

Effectively models contact physics through tactile prediction.

Abstract

Vision-Language-Action (VLA) models have shown remarkable generalization by mapping web-scale knowledge to robotic control, yet they remain blind to physical contact. Consequently, they struggle with contact-rich manipulation tasks that require reasoning about force, texture, and slip. While some approaches incorporate low-dimensional tactile signals, they fail to capture the high-resolution dynamics essential for such interactions. To address this limitation, we introduce DreamTacVLA, a framework that grounds VLA models in contact physics by learning to feel the future. Our model adopts a hierarchical perception scheme in which high-resolution tactile images serve as micro-vision inputs coupled with wrist-camera local vision and third-person macro vision. To reconcile these multi-scale sensory streams, we first train a unified policy with a Hierarchical Spatial Alignment (HSA) loss…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

michaelyeah7/learning-to-feel-the-future
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.