CompliantVLA-adaptor: VLM-Guided Variable Impedance Action for Safe Contact-Rich Manipulation

Heng Zhang; Wei-Hsing Huang; Qiyi Tong; Gokhan Solak; Puze Liu; Kaidi Zhang; Sheng Liu; Jan Peters; Yu She; Arash Ajoudani

arXiv:2601.15541·cs.RO·March 18, 2026

CompliantVLA-adaptor: VLM-Guided Variable Impedance Action for Safe Contact-Rich Manipulation

Heng Zhang, Wei-Hsing Huang, Qiyi Tong, Gokhan Solak, Puze Liu, Kaidi Zhang, Sheng Liu, Jan Peters, Yu She, Arash Ajoudani

PDF

Open Access

TL;DR

This paper introduces CompliantVLA-adaptor, which enhances vision-language action models with VLM-informed variable impedance control to improve safety and success in contact-rich robotic manipulation tasks.

Contribution

It presents a novel method combining VLM-based context understanding with real-time impedance regulation for safer contact-rich manipulation.

Findings

01

Outperforms baseline VLA models in success rates

02

Reduces force violations during tasks

03

Effective in both simulation and real-world scenarios

Abstract

We propose a CompliantVLA-adaptor that augments the state-of-the-art Vision-Language-Action (VLA) models with vision-language model (VLM)-informed context-aware variable impedance control (VIC) to improve the safety and effectiveness of contact-rich robotic manipulation tasks. Existing VLA systems (e.g., RDT, Pi0.5, OpenVLA-oft) typically output position, but lack force-aware adaptation, leading to unsafe or failed interactions in physical tasks involving contact, compliance, or uncertainty. In the proposed CompliantVLA-adaptor, a VLM interprets task context from images and natural language to adapt the stiffness and damping parameters of a VIC controller. These parameters are further regulated using real-time force/torque feedback to ensure interaction forces remain within safe thresholds. We demonstrate that our method outperforms the VLA baselines on a suite of complex contact-rich…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Teleoperation and Haptic Systems · Social Robot Interaction and HRI