Matrix-3D: Omnidirectional Explorable 3D World Generation

Zhongqi Yang; Wenhang Ge; Yuqi Li; Jiaqi Chen; Haoyuan Li; Mengyin An; Fei Kang; Hua Xue; Baixin Xu; Yuyang Yin; Eric Li; Yang Liu; Yikai Wang; Hao-Xiang Guo; Yahui Zhou

arXiv:2508.08086·cs.CV·August 12, 2025

Matrix-3D: Omnidirectional Explorable 3D World Generation

Zhongqi Yang, Wenhang Ge, Yuqi Li, Jiaqi Chen, Haoyuan Li, Mengyin An, Fei Kang, Hua Xue, Baixin Xu, Yuyang Yin, Eric Li, Yang Liu, Yikai Wang, Hao-Xiang Guo, Yahui Zhou

PDF

Open Access 1 Models

TL;DR

Matrix-3D introduces a panoramic-based framework for wide-coverage, explorable 3D world generation from a single image or text, combining novel diffusion models and reconstruction methods to improve scene quality and geometric consistency.

Contribution

The paper presents a new panoramic video diffusion model and two 3D reconstruction methods, along with a large-scale synthetic dataset, advancing 3D scene generation from limited inputs.

Findings

01

Achieves state-of-the-art panoramic video generation quality

02

Enables high-quality, geometrically consistent 3D scene reconstruction

03

Demonstrates effective 3D world generation from minimal input data

Abstract

Explorable 3D world generation from a single image or text prompt forms a cornerstone of spatial intelligence. Recent works utilize video model to achieve wide-scope and generalizable 3D world generation. However, existing approaches often suffer from a limited scope in the generated scenes. In this work, we propose Matrix-3D, a framework that utilize panoramic representation for wide-coverage omnidirectional explorable 3D world generation that combines conditional video generation and panoramic 3D reconstruction. We first train a trajectory-guided panoramic video diffusion model that employs scene mesh renders as condition, to enable high-quality and geometrically consistent scene video generation. To lift the panorama scene video to 3D world, we propose two separate methods: (1) a feed-forward large panorama reconstruction model for rapid 3D scene reconstruction and (2) an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Skywork/Matrix-3D
model· ♡ 50
♡ 50

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis