BETTY Dataset: A Multi-modal Dataset for Full-Stack Autonomy
Micah Nye, Ayoub Raji, Andrew Saba, Eidan Erlich, Robert Exley, Aragya Goyal, Alexander Matros, Ritesh Misra, Matthew Sivaprakasam, Marko Bertogna, Deva Ramanan, Sebastian Scherer

TL;DR
The BETTY dataset provides a comprehensive, multi-modal collection of autonomous racing vehicle data over four years, supporting advanced research in full-stack autonomous systems including perception, state estimation, and dynamics modeling.
Contribution
It introduces a large-scale, multi-modal dataset with diverse environments and dynamic scenarios, enabling development and evaluation of full-stack autonomous vehicle algorithms.
Findings
Contains over 13 hours and 32TB of data from 6 racing environments.
Includes high-speed dynamic events like crashes and tire traction loss.
Supports training and testing of full autonomy pipelines.
Abstract
We present the BETTY dataset, a large-scale, multi-modal dataset collected on several autonomous racing vehicles, targeting supervised and self-supervised state estimation, dynamics modeling, motion forecasting, perception, and more. Existing large-scale datasets, especially autonomous vehicle datasets, focus primarily on supervised perception, planning, and motion forecasting tasks. Our work enables multi-modal, data-driven methods by including all sensor inputs and the outputs from the software stack, along with semantic metadata and ground truth information. The dataset encompasses 4 years of data, currently comprising over 13 hours and 32TB, collected on autonomous racing vehicle platforms. This data spans 6 diverse racing environments, including high-speed oval courses, for single and multi-agent algorithm evaluation in feature-sparse scenarios, as well as high-speed road courses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Vehicle Dynamics and Control Systems · Adversarial Robustness in Machine Learning
