MetaEarth3D: Unlocking World-scale 3D Generation with Spatially Scalable Generative Modeling
Jinqi Cao, Zhiping Yu, Baihong Lin, Chenyang Liu, Zhenwei Shi, Zhengxia Zou

TL;DR
MetaEarth3D introduces a groundbreaking generative model capable of creating spatially consistent 3D scenes at a planetary scale, advancing Earth observation and large-scale spatial understanding.
Contribution
It is the first model to incorporate spatial scale as a core dimension, enabling ultra-wide-area 3D generation across diverse terrains and urban environments.
Findings
Generated 3D scenes are both visually and geospatially realistic.
MetaEarth3D can produce unbounded, multi-level terrains and urban environments.
Built on 10 million real-world images, it demonstrates strong realism and diversity.
Abstract
Recent generative AI models have achieved remarkable breakthroughs in language and visual understanding. However, although these models can generate realistic visual content, their spatial scale remains confined to bounded environments, preventing them from capturing how geographic environments evolve across thousands of kilometers or from modeling the spatial structure of the large-scale physical world. This limitation poses a critical challenge for ultra-wide-area spatial intelligence in Earth observation and simulation, revealing a deeper gap in generative AI: progress has relied primarily on scaling model parameters and training data, while overlooking spatial scale as a core dimension of intelligence. Here, motivated by this missing dimension, we investigate spatial scale as a new scaling axis in foundation models and present MetaEarth3D, the first generative foundation model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
