MetaEarth3D: Unlocking World-scale 3D Generation with Spatially Scalable Generative Modeling

Jinqi Cao; Zhiping Yu; Baihong Lin; Chenyang Liu; Zhenwei Shi; Zhengxia Zou

arXiv:2604.22828·cs.CV·April 28, 2026

MetaEarth3D: Unlocking World-scale 3D Generation with Spatially Scalable Generative Modeling

Jinqi Cao, Zhiping Yu, Baihong Lin, Chenyang Liu, Zhenwei Shi, Zhengxia Zou

PDF

TL;DR

MetaEarth3D introduces a groundbreaking generative model capable of creating spatially consistent 3D scenes at a planetary scale, advancing Earth observation and large-scale spatial understanding.

Contribution

It is the first model to incorporate spatial scale as a core dimension, enabling ultra-wide-area 3D generation across diverse terrains and urban environments.

Findings

01

Generated 3D scenes are both visually and geospatially realistic.

02

MetaEarth3D can produce unbounded, multi-level terrains and urban environments.

03

Built on 10 million real-world images, it demonstrates strong realism and diversity.

Abstract

Recent generative AI models have achieved remarkable breakthroughs in language and visual understanding. However, although these models can generate realistic visual content, their spatial scale remains confined to bounded environments, preventing them from capturing how geographic environments evolve across thousands of kilometers or from modeling the spatial structure of the large-scale physical world. This limitation poses a critical challenge for ultra-wide-area spatial intelligence in Earth observation and simulation, revealing a deeper gap in generative AI: progress has relied primarily on scaling model parameters and training data, while overlooking spatial scale as a core dimension of intelligence. Here, motivated by this missing dimension, we investigate spatial scale as a new scaling axis in foundation models and present MetaEarth3D, the first generative foundation model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.