Temporal Sampling Frequency Matters: A Capacity-Aware Study of End-to-End Driving Trajectory Prediction
Yumao Liu, Tao Liu, Xiangyu Li, Jiaxiang Li, Ke Ma

TL;DR
This study investigates how the frequency of camera frame sampling affects autonomous driving trajectory prediction, revealing that optimal sampling varies with model capacity and dataset.
Contribution
It introduces a capacity-aware analysis of sampling frequency, challenging the assumption that higher frequency always improves performance in end-to-end driving models.
Findings
Smaller models perform best at lower or intermediate sampling frequencies.
AutoVLA model achieves optimal performance at the highest sampling frequency.
Optimal sampling frequency depends on model size and dataset, not just data density.
Abstract
End to end (E2E) autonomous driving trajectory prediction is often trained with camera frames sampled at the highest available temporal frequency, assuming that denser sampling improves performance. We question this assumption by treating temporal sampling frequency as an explicit training set design variable. Starting from high frequency E2E driving datasets, we construct frequency sweep training sets by temporally subsampling camera frames along each trajectory. For each model dataset pair, we train and evaluate the same model under a fixed protocol, so the frequency response reflects how prediction performance changes with sampling frequency. We analyze this response from a capacity aware perspective. Sparse sampling may miss driving relevant cues, while dense sampling may add redundant visual content and off manifold noise. For finite capacity models, this can create a driving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
