Environment-Aware Adaptive Pruning with Interleaved Inference Orchestration for Vision-Language-Action Models
Yuting Huang, Leilei Ding, Zhipeng Tang, Zenghuan Zhu, Jiajun Deng, Xinrui Lin, Shuo Liu, Haojie Ren, Jianmin Ji, Yanyong Zhang

TL;DR
EcoVLA is a training-free, adaptive pruning framework for Vision-Language-Action models that dynamically adjusts sparsity based on environment changes, significantly improving inference speed with minimal accuracy loss.
Contribution
EcoVLA introduces a novel environment-aware adaptive pruning method combined with interleaved inference orchestration, enabling real-time VLA model acceleration without retraining.
Findings
Achieves up to 2.18× speedup with only 0.5% success rate drop.
Effectively adapts to environment changes during VLA inference.
Validated on diverse models, benchmarks, and real-world robots.
Abstract
While Vision-Language-Action (VLA) models hold promise in embodied intelligence, their large parameter counts lead to substantial inference latency that hinders real-time manipulation, motivating parameter sparsification. However, as the environment evolves during VLA execution, the optimal sparsity patterns change accordingly. Static pruning lacks the adaptability required for environment dynamics, whereas fixed-interval dynamic layer pruning suffers from coarse granularity and high retraining overheads. To bridge this gap, we propose EcoVLA, a training-free, plug-and-play adaptive pruning framework that supports orthogonal combination with existing VLA acceleration methods. EcoVLA comprises two components: Environment-aware Adaptive Pruning (EAP) and Interleaved Inference Orchestration (). EAP is a lightweight adaptive channel pruning method that incorporates the temporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
