VertiFormer: A Data-Efficient Multi-Task Transformer for Off-Road Robot   Mobility

Mohammad Nazeri; Anuj Pokhrel; Alexandyr Card; Aniket Datar; Garrett; Warnell; and Xuesu Xiao

arXiv:2502.00543·cs.RO·February 4, 2025

VertiFormer: A Data-Efficient Multi-Task Transformer for Off-Road Robot Mobility

Mohammad Nazeri, Anuj Pokhrel, Alexandyr Card, Aniket Datar, Garrett, Warnell, and Xuesu Xiao

PDF

Open Access 1 Repo

TL;DR

VertiFormer is a data-efficient multi-task Transformer that enables off-road robot mobility understanding using only one hour of data, employing novel modeling techniques to predict poses, actions, and terrain patches.

Contribution

The paper introduces VertiFormer, a novel Transformer architecture specifically designed for off-road robot mobility with limited data and multi-task capabilities.

Findings

01

Effective with only one hour of data

02

Supports multiple off-road mobility tasks

03

Operates efficiently on physical robots

Abstract

Sophisticated learning architectures, e.g., Transformers, present a unique opportunity for robots to understand complex vehicle-terrain kinodynamic interactions for off-road mobility. While internet-scale data are available for Natural Language Processing (NLP) and Computer Vision (CV) tasks to train Transformers, real-world mobility data are difficult to acquire with physical robots navigating off-road terrain. Furthermore, training techniques specifically designed to process text and image data in NLP and CV may not apply to robot mobility. In this paper, we propose VertiFormer, a novel data-efficient multi-task Transformer model trained with only one hour of data to address such challenges of applying Transformer architectures for robot mobility on extremely rugged, vertically challenging, off-road terrain. Specifically, VertiFormer employs a new learnable masked modeling and next…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mhnazeri/vertiformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmbedded Systems Design Techniques · Real-time simulation and control systems · Modular Robots and Swarm Intelligence

MethodsAttention Is All You Need · Label Smoothing · Layer Normalization · Linear Layer · Byte Pair Encoding · Dense Connections · Residual Connection · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam