Tube Diffusion Policy: Reactive Visual-Tactile Policy Learning for Contact-rich Manipulation
Teng Xue, Alberto Rigo, Bingjian Huang, Jiayi Shen, Zhengtong Xu, Nick Colonnese, Amirhossein H. Memar

TL;DR
This paper introduces Tube Diffusion Policy (TDP), a reactive visual-tactile learning framework that enhances contact-rich manipulation by enabling rapid, adaptive responses through generative models and tube-based feedback control.
Contribution
TDP combines diffusion-based imitation learning with tube-based feedback control to improve reactivity and robustness in contact-rich manipulation tasks.
Findings
TDP outperforms state-of-the-art imitation learning methods on multiple benchmarks.
Real-world experiments demonstrate TDP's robustness under contact uncertainty and disturbances.
Action tube mechanism reduces denoising steps, enabling real-time control.
Abstract
Contact-rich manipulation is central to many everyday human activities, requiring continuous adaptation to contact uncertainty and external disturbances through multi-modal perception, particularly vision and tactile feedback. While imitation learning has shown strong potential for learning complex manipulation behaviors, most existing approaches rely on action chunking, which fundamentally limits their ability to react to unforeseen observations during execution. This limitation becomes especially critical in contact-rich scenarios, where physical uncertainty and high-frequency tactile feedback demand rapid, reactive control. To address this challenge, we propose Tube Diffusion Policy (TDP), a novel reactive visual-tactile policy learning framework that bridges diffusion-based imitation learning with tube-based feedback control. By leveraging the expressive power of generative models,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
