Position-aware Guided Point Cloud Completion with CLIP Model
Feng Zhou, Qi Zhang, Ju Dai, Lei Li, Qing Fan, Junliang Xing

TL;DR
This paper introduces a multimodal point cloud completion method that leverages a position-aware module and the CLIP model to improve the accuracy and detail of completed 3D shapes, outperforming existing techniques.
Contribution
It proposes a novel multimodal framework with a position-aware module and utilizes CLIP for richer feature extraction in point cloud completion.
Findings
Outperforms state-of-the-art point cloud completion methods
Enhances spatial information of missing parts through a weighted map learning mechanism
Utilizes CLIP to incorporate detailed visual and textual information
Abstract
Point cloud completion aims to recover partial geometric and topological shapes caused by equipment defects or limited viewpoints. Current methods either solely rely on the 3D coordinates of the point cloud to complete it or incorporate additional images with well-calibrated intrinsic parameters to guide the geometric estimation of the missing parts. Although these methods have achieved excellent performance by directly predicting the location of complete points, the extracted features lack fine-grained information regarding the location of the missing area. To address this issue, we propose a rapid and efficient method to expand an unimodal framework into a multimodal framework. This approach incorporates a position-aware module designed to enhance the spatial information of the missing parts through a weighted map learning mechanism. In addition, we establish a Point-Text-Image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · 3D Shape Modeling and Analysis · Remote Sensing and LiDAR Applications
MethodsContrastive Language-Image Pre-training
