Exploiting Point-Language Models with Dual-Prompts for 3D Anomaly   Detection

Jiaxiang Wang; Haote Xu; Xiaolu Chen; Haodi Xu; Yue Huang; Xinghao; Ding; Xiaotong Tu

arXiv:2502.11307·cs.CV·February 18, 2025

Exploiting Point-Language Models with Dual-Prompts for 3D Anomaly Detection

Jiaxiang Wang, Haote Xu, Xiaolu Chen, Haodi Xu, Yue Huang, Xinghao, Ding, Xiaotong Tu

PDF

Open Access

TL;DR

This paper introduces PLANE, a dual-prompt point-language model for 3D anomaly detection that generalizes across multiple categories with a single model, using dynamic prompts and pseudo 3D data augmentation.

Contribution

The paper presents a novel dual-prompt learning framework with dynamic prompt creation and pseudo 3D anomaly generation for improved multi-category 3D anomaly detection.

Findings

01

Achieves +8.7%/+17% detection and localization gains on Anomaly-ShapeNet.

02

Obtains +4.3%/+4.1% gains on Real3D-AD dataset.

03

Operates effectively under a multi-class-one-model paradigm.

Abstract

Anomaly detection (AD) in 3D point clouds is crucial in a wide range of industrial applications, especially in various forms of precision manufacturing. Considering the industrial demand for reliable 3D AD, several methods have been developed. However, most of these approaches typically require training separate models for each category, which is memory-intensive and lacks flexibility. In this paper, we propose a novel Point-Language model with dual-prompts for 3D ANomaly dEtection (PLANE). The approach leverages multi-modal prompts to extend the strong generalization capabilities of pre-trained Point-Language Models (PLMs) to the domain of 3D point cloud AD, achieving impressive detection performance across multiple categories using a single model. Specifically, we propose a dual-prompt learning method, incorporating both text and point cloud prompts. The method utilizes a dynamic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Modular Robots and Swarm Intelligence · Image Processing and 3D Reconstruction