MoRE: 3D Visual Geometry Reconstruction Meets Mixture-of-Experts

Jingnan Gao; Zhe Wang; Xianze Fang; Xingyu Ren; Zhuo Chen; Shengqi Liu; Yuhao Cheng; Jiangjing Lyu; Xiaokang Yang; Yichao Yan

arXiv:2510.27234·cs.CV·November 3, 2025

MoRE: 3D Visual Geometry Reconstruction Meets Mixture-of-Experts

Jingnan Gao, Zhe Wang, Xianze Fang, Xingyu Ren, Zhuo Chen, Shengqi Liu, Yuhao Cheng, Jiangjing Lyu, Xiaokang Yang, Yichao Yan

PDF

Open Access

TL;DR

MoRE is a scalable 3D visual foundation model using a Mixture-of-Experts architecture that improves geometric reconstruction robustness and accuracy across diverse tasks and real-world conditions.

Contribution

The paper introduces MoRE, a novel dense 3D model leveraging MoE architecture, confidence-based refinement, and semantic integration for enhanced scalability and task adaptability.

Findings

01

Achieves state-of-the-art results on multiple benchmarks.

02

Supports diverse 3D tasks without additional computation.

03

Improves robustness and accuracy in real-world scenarios.

Abstract

Recent advances in language and vision have demonstrated that scaling up model capacity consistently improves performance across diverse tasks. In 3D visual geometry reconstruction, large-scale training has likewise proven effective for learning versatile representations. However, further scaling of 3D models is challenging due to the complexity of geometric supervision and the diversity of 3D data. To overcome these limitations, we propose MoRE, a dense 3D visual foundation model based on a Mixture-of-Experts (MoE) architecture that dynamically routes features to task-specific experts, allowing them to specialize in complementary data aspects and enhance both scalability and adaptability. Aiming to improve robustness under real-world conditions, MoRE incorporates a confidence-based depth refinement module that stabilizes and refines geometric estimation. In addition, it integrates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Robotics and Sensor-Based Localization