MGPC: Multimodal Network for Generalizable Point Cloud Completion With Modality Dropout and Progressive Decoding
Jiangyuan Liu, Yuhao Zhao, Hongxuan Ma, Zhe Liu, Jian Wang, and Wei Zou

TL;DR
MGPC is a multimodal framework for point cloud completion that combines point clouds, images, and text, using dropout, Transformer fusion, and progressive decoding to enhance robustness and generalization in real-world scenarios.
Contribution
The paper introduces MGPC, a novel multimodal point cloud completion model with modality dropout, Transformer fusion, and progressive decoding, plus a large-scale benchmark MGPC-1M.
Findings
Outperforms prior methods on MGPC-1M and in-the-wild data.
Demonstrates strong generalization to real-world scenarios.
Shows robustness and scalability in diverse conditions.
Abstract
Point cloud completion aims to recover complete 3D geometry from partial observations caused by limited viewpoints and occlusions. Existing learning-based works, including 3D Convolutional Neural Network (CNN)-based, point-based, and Transformer-based methods, have achieved strong performance on synthetic benchmarks. However, due to the limitations of modality, scalability, and generative capacity, their generalization to novel objects and real-world scenarios remains challenging. In this paper, we propose MGPC, a generalizable multimodal point cloud completion framework that integrates point clouds, RGB images, and text within a unified architecture. MGPC introduces an innovative modality dropout strategy, a Transformer-based fusion module, and a novel progressive generator to improve robustness, scalability, and geometric modeling capability. We further develop an automatic data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Advanced Numerical Analysis Techniques · 3D Surveying and Cultural Heritage
