TL;DR
EagleVision introduces a comprehensive multi-task benchmark for high-speed autonomous racing perception, enabling systematic evaluation of cross-domain generalization in LiDAR-based detection and trajectory prediction.
Contribution
It provides a unified, annotated dataset and evaluation protocol for high-speed racing perception tasks, facilitating research on domain transfer and generalization.
Findings
Pretraining on urban data improves detection performance.
Intermediate pretraining on real racing data enhances transfer to racing domain.
Models trained on Indy data outperform in-domain models on trajectory prediction.
Abstract
High-speed autonomous racing presents extreme perception challenges, including large relative velocities and substantial domain shifts from conventional urban-driving datasets. Existing benchmarks do not adequately capture these high-dynamic conditions. We introduce EagleVision, a unified LiDAR-based multi-task benchmark for 3D detection and trajectory prediction in high-speed racing, providing newly annotated 3D bounding boxes for the Indy Autonomous Challenge dataset (14,893 frames) and the A2RL Real competition dataset (1,163 frames), together with 12,000 simulator-generated annotated frames, all standardized under a common evaluation protocol. Using a dataset-centric transfer framework, we quantify cross-domain generalization across urban, simulator, and real racing domains. Urban pretraining improves detection over scratch training (NDS 0.72 vs. 0.69), while intermediate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
