TL;DR
This paper improves the efficiency of reasoning-based end-to-end autonomous driving systems by redesigning Alpamayo 1 into a single-reasoning architecture and optimizing diffusion-based action generation, significantly reducing inference latency.
Contribution
It systematically analyzes Alpamayo 1's architecture, demonstrating that single-reasoning maintains diversity and accelerates inference, with practical optimizations reducing latency by over 69%.
Findings
Replacing multi-reasoning with single-reasoning preserves trajectory diversity.
Optimizations eliminate inter-block overhead, accelerating diffusion-based generation.
Achieved a 69.23% reduction in inference latency without sacrificing performance.
Abstract
Reasoning-based end-to-end (E2E) autonomous driving has recently emerged as a promising approach to improving the interpretability of driving decisions as it can generate human-readable reasoning together with predicted trajectories. Such approaches commonly generate multiple trajectories to capture diverse future behaviors, and they fall into two categories: (1) multi-reasoning, where one reasoning sequence is generated per trajectory, and (2) single-reasoning, where a single reasoning is shared across all trajectories. The former offers richer diversity at the cost of redundant computation, while the latter is more efficient but is often assumed to sacrifice diversity. Alpamayo 1, a representative system, adopts the multi-reasoning approach and achieves competitive trajectory prediction performance. However, the efficiency of this design remains largely unexplored, making it a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
