TL;DR
This paper introduces a health-conditioned vision-language-action model for robots that adapts to physical failures by incorporating a health vector, improving task success with degraded joints.
Contribution
The authors propose a novel health-aware VLA model with a health projector module, trained on malfunction data, enabling robots to adapt to joint degradations.
Findings
Model successfully operates with various degraded joint configurations.
Lightweight addition enables adaptation without retraining the entire model.
Code and dataset will be publicly available at the provided GitHub link.
Abstract
Research on Vision Language Action (VLA) models has been increasing rapidly in recent years. Although some of them focus on detecting, preventing, and recovering from task failures, they usually don't deal with adapting to robot's physical failures. In real-life scenarios, most robots face physical degradations in various ways such as joint degradation, actuator failure, or weak gripper. We introduce malfunction-aware (health-conditioned) VLA that takes a health vector as an input that gives information about robots' joints' operation angle and torque capability, and adapts its predictions to complete the tasks with the degraded joints. To achieve this, we inject a Health Projector module to the VLA-Adapter architecture and train it on malfunction robot data we collected on the LIBERO environment [1]. We collect 128 teleoperated episodes on Libero-Spatial tasks. Our results show that,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
