LaGen: Towards Autoregressive LiDAR Scene Generation

Sizhuo Zhou; Xiaosong Jia; Fanrui Zhang; Junjie Li; Juyong Zhang; Yukang Feng; Jianwen Sun; Songbur Wong; Junqi You; Junchi Yan

arXiv:2511.21256·cs.CV·November 27, 2025

LaGen: Towards Autoregressive LiDAR Scene Generation

Sizhuo Zhou, Xiaosong Jia, Fanrui Zhang, Junjie Li, Juyong Zhang, Yukang Feng, Jianwen Sun, Songbur Wong, Junqi You, Junchi Yan

PDF

Open Access

TL;DR

LaGen is a novel autoregressive framework for long-horizon LiDAR scene generation that effectively uses single-frame input and bounding box conditions to produce high-fidelity 4D point clouds, advancing autonomous driving simulation.

Contribution

Introduces LaGen, the first autoregressive model for frame-by-frame LiDAR scene generation, with modules for scene decoupling and noise modulation to improve interactivity and long-term accuracy.

Findings

01

Outperforms existing LiDAR generation models on nuScenes

02

Effective in long-horizon scene generation with high fidelity

03

Enhances object-level interactivity in LiDAR scene synthesis

Abstract

Generative world models for autonomous driving (AD) have become a trending topic. Unlike the widely studied image modality, in this work we explore generative world models for LiDAR data. Existing generation methods for LiDAR data only support single frame generation, while existing prediction approaches require multiple frames of historical input and can only deterministically predict multiple frames at once, lacking interactivity. Both paradigms fail to support long-horizon interactive generation. To this end, we introduce LaGen, which to the best of our knowledge is the first framework capable of frame-by-frame autoregressive generation of long-horizon LiDAR scenes. LaGen is able to take a single-frame LiDAR input as a starting point and effectively utilize bounding box information as conditions to generate high-fidelity 4D scene point clouds. In addition, we introduce a scene…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · 3D Shape Modeling and Analysis