Repurposing 3D Generative Model for Autoregressive Layout Generation

Haoran Feng; Yifan Niu; Zehuan Huang; Yang-Tian Sun; Chunchao Guo; Yuxin Peng; Lu Sheng

arXiv:2604.16299·cs.CV·April 20, 2026

Repurposing 3D Generative Model for Autoregressive Layout Generation

Haoran Feng, Yifan Niu, Zehuan Huang, Yang-Tian Sun, Chunchao Guo, Yuxin Peng, Lu Sheng

PDF

1 Repo

TL;DR

LaviGen is a novel framework that repurposes 3D generative models for autoregressive 3D layout generation, explicitly modeling geometric and physical relations to produce coherent scenes.

Contribution

It introduces a new autoregressive approach with an adapted 3D diffusion model and a dual-guidance distillation mechanism, achieving superior performance and efficiency.

Findings

01

19% higher physical plausibility than the state of the art

02

65% faster computation

03

Effective in generating coherent 3D scenes

Abstract

We introduce LaviGen, a framework that repurposes 3D generative models for 3D layout generation. Unlike previous methods that infer object layouts from textual descriptions, LaviGen operates directly in the native 3D space, formulating layout generation as an autoregressive process that explicitly models geometric relations and physical constraints among objects, producing coherent and physically plausible 3D scenes. To further enhance this process, we propose an adapted 3D diffusion model that integrates scene, object, and instruction information and employs a dual-guidance self-rollout distillation mechanism to improve efficiency and spatial accuracy. Extensive experiments on the LayoutVLM benchmark show LaviGen achieves superior 3D layout generation performance, with 19% higher physical plausibility than the state of the art and 65% faster computation. Our code is publicly available…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fenghora/LaviGen
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.