AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic   Segmentation via Adaptive Label Correction

Pufan Zou; Shijia Zhao; Weijie Huang; Qiming Xia; Chenglu Wen; Wei Li,; Cheng Wang

arXiv:2412.18255·cs.CV·December 25, 2024

AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction

Pufan Zou, Shijia Zhao, Weijie Huang, Qiming Xia, Chenglu Wen, Wei Li,, Cheng Wang

PDF

Open Access

TL;DR

AdaCo is a novel label-free learning approach that enhances 3D semantic segmentation by adaptively correcting noisy labels using cross-modal supervision, noise adjustment, and robust loss functions, outperforming existing methods on outdoor datasets.

Contribution

Introduces AdaCo, a new label-free learning framework with adaptive label correction, cross-modal supervision, and robust loss for improved 3D semantic segmentation in outdoor environments.

Findings

01

Outperforms existing methods on outdoor datasets

02

Effectively mitigates noise in supervision signals

03

Enhances 3D segmentation accuracy with adaptive techniques

Abstract

Recently, Visual Foundation Models (VFMs) have shown a remarkable generalization performance in 3D perception tasks. However, their effectiveness in large-scale outdoor datasets remains constrained by the scarcity of accurate supervision signals, the extensive noise caused by variable outdoor conditions, and the abundance of unknown objects. In this work, we propose a novel label-free learning method, Adaptive Label Correction (AdaCo), for 3D semantic segmentation. AdaCo first introduces the Cross-modal Label Generation Module (CLGM), providing cross-modal supervision with the formidable interpretive capabilities of the VFMs. Subsequently, AdaCo incorporates the Adaptive Noise Corrector (ANC), updating and adjusting the noisy samples within this supervision iteratively during training. Moreover, we develop an Adaptive Robust Loss (ARL) function to modulate each sample's sensitivity to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Video Analysis and Summarization · Machine Learning and Data Classification