Dexbotic: Open-Source Vision-Language-Action Toolbox

Bin Xie; Erjin Zhou; Fan Jia; Hao Shi; Haoqiang Fan; Haowei Zhang; Hebei Li; Jianjian Sun; Jie Bin; Junwen Huang; Kai Liu; Kaixin Liu; Kefan Gu; Lin Sun; Meng Zhang; Peilong Han; Ruitao Hao; Ruitao Zhang; Saike Huang; Songhan Xie; Tiancai Wang; Tianle Liu; Wenbin Tang; Wenqi Zhu; Yang Chen; Yingfei Liu; Yizhuang Zhou; Yu Liu; Yucheng Zhao; Yunchao Ma; Yunfei Wei; Yuxiang Chen; Ze Chen; Zeming Li; Zhao Wu; Ziheng Zhang; Ziming Liu; Ziwei Yan; Ziyu Zhang

arXiv:2510.23511·cs.RO·October 28, 2025

Dexbotic: Open-Source Vision-Language-Action Toolbox

Bin Xie, Erjin Zhou, Fan Jia, Hao Shi, Haoqiang Fan, Haowei Zhang, Hebei Li, Jianjian Sun, Jie Bin, Junwen Huang, Kai Liu, Kaixin Liu, Kefan Gu, Lin Sun, Meng Zhang, Peilong Han, Ruitao Hao, Ruitao Zhang, Saike Huang, Songhan Xie, Tiancai Wang, Tianle Liu, Wenbin Tang, Wenqi Zhu

PDF

TL;DR

Dexbotic is an open-source, versatile toolbox for Vision-Language-Action research, supporting multiple policies, easy experiment development, and enhanced pretrained models to advance embodied intelligence studies.

Contribution

It introduces a comprehensive, open-source VLA toolbox with multi-policy support, easy experiment customization, and improved pretrained models for the research community.

Findings

01

Supports multiple VLA policies simultaneously

02

Enables quick development of new VLA experiments

03

Provides stronger pretrained models for better performance

Abstract

In this paper, we present Dexbotic, an open-source Vision-Language-Action (VLA) model toolbox based on PyTorch. It aims to provide a one-stop VLA research service for professionals in the field of embodied intelligence. It offers a codebase that supports multiple mainstream VLA policies simultaneously, allowing users to reproduce various VLA methods with just a single environment setup. The toolbox is experiment-centric, where the users can quickly develop new VLA experiments by simply modifying the Exp script. Moreover, we provide much stronger pretrained models to achieve great performance improvements for state-of-the-art VLA policies. Dexbotic will continuously update to include more of the latest pre-trained foundation models and cutting-edge VLA models in the industry.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.