EagleVision: A Multi-Task Benchmark for Cross-Domain Perception in High-Speed Autonomous Racing

Zakhar Yagudin; Murad Mebrahtu; Ren Jin; Jiaqi Huang; Yujia Yue; Dzmitry Tsetserukou; Jorge Dias; Majid Khonji

arXiv:2604.11400·cs.RO·April 14, 2026

EagleVision: A Multi-Task Benchmark for Cross-Domain Perception in High-Speed Autonomous Racing

Zakhar Yagudin, Murad Mebrahtu, Ren Jin, Jiaqi Huang, Yujia Yue, Dzmitry Tsetserukou, Jorge Dias, Majid Khonji

PDF

1 Repo

TL;DR

EagleVision introduces a comprehensive multi-task benchmark for high-speed autonomous racing perception, enabling systematic evaluation of cross-domain generalization in LiDAR-based detection and trajectory prediction.

Contribution

It provides a unified, annotated dataset and evaluation protocol for high-speed racing perception tasks, facilitating research on domain transfer and generalization.

Findings

01

Pretraining on urban data improves detection performance.

02

Intermediate pretraining on real racing data enhances transfer to racing domain.

03

Models trained on Indy data outperform in-domain models on trajectory prediction.

Abstract

High-speed autonomous racing presents extreme perception challenges, including large relative velocities and substantial domain shifts from conventional urban-driving datasets. Existing benchmarks do not adequately capture these high-dynamic conditions. We introduce EagleVision, a unified LiDAR-based multi-task benchmark for 3D detection and trajectory prediction in high-speed racing, providing newly annotated 3D bounding boxes for the Indy Autonomous Challenge dataset (14,893 frames) and the A2RL Real competition dataset (1,163 frames), together with 12,000 simulator-generated annotated frames, all standardized under a common evaluation protocol. Using a dataset-centric transfer framework, we quantify cross-domain generalization across urban, simulator, and real racing domains. Urban pretraining improves detection over scratch training (NDS 0.72 vs. 0.69), while intermediate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://avlab.io/EagleVision
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.