Position-aware Guided Point Cloud Completion with CLIP Model

Feng Zhou; Qi Zhang; Ju Dai; Lei Li; Qing Fan; Junliang Xing

arXiv:2412.08271·cs.CV·December 12, 2024

Position-aware Guided Point Cloud Completion with CLIP Model

Feng Zhou, Qi Zhang, Ju Dai, Lei Li, Qing Fan, Junliang Xing

PDF

Open Access

TL;DR

This paper introduces a multimodal point cloud completion method that leverages a position-aware module and the CLIP model to improve the accuracy and detail of completed 3D shapes, outperforming existing techniques.

Contribution

It proposes a novel multimodal framework with a position-aware module and utilizes CLIP for richer feature extraction in point cloud completion.

Findings

01

Outperforms state-of-the-art point cloud completion methods

02

Enhances spatial information of missing parts through a weighted map learning mechanism

03

Utilizes CLIP to incorporate detailed visual and textual information

Abstract

Point cloud completion aims to recover partial geometric and topological shapes caused by equipment defects or limited viewpoints. Current methods either solely rely on the 3D coordinates of the point cloud to complete it or incorporate additional images with well-calibrated intrinsic parameters to guide the geometric estimation of the missing parts. Although these methods have achieved excellent performance by directly predicting the location of complete points, the extracted features lack fine-grained information regarding the location of the missing area. To address this issue, we propose a rapid and efficient method to expand an unimodal framework into a multimodal framework. This approach incorporates a position-aware module designed to enhance the spatial information of the missing parts through a weighted map learning mechanism. In addition, we establish a Point-Text-Image…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Surveying and Cultural Heritage · 3D Shape Modeling and Analysis · Remote Sensing and LiDAR Applications

MethodsContrastive Language-Image Pre-training