Hierarchical Image-Guided 3D Point Cloud Segmentation in Industrial Scenes via Multi-View Bayesian Fusion
Yu Zhu, Naoya Chiba, Koichi Hashimoto

TL;DR
This paper introduces a hierarchical, image-guided 3D segmentation framework for industrial scenes that refines segmentation from instance to part level using multi-view Bayesian fusion, effectively handling occlusion and complexity.
Contribution
The proposed method uniquely combines multi-view image rendering, SAM and YOLO-World prompts, and Bayesian fusion for accurate, annotation-efficient 3D segmentation in complex industrial environments.
Findings
Achieves high per-class mIoU scores on real-world factory data.
Effectively handles occlusion and structural complexity.
Demonstrates robustness and generalization on public datasets.
Abstract
Reliable 3D segmentation is critical for understanding complex scenes with dense layouts and multi-scale objects, as commonly seen in industrial environments. In such scenarios, heavy occlusion weakens geometric boundaries between objects, and large differences in object scale will cause end-to-end models fail to capture both coarse and fine details accurately. Existing 3D point-based methods require costly annotations, while image-guided methods often suffer from semantic inconsistencies across views. To address these challenges, we propose a hierarchical image-guided 3D segmentation framework that progressively refines segmentation from instance-level to part-level. Instance segmentation involves rendering a top-view image and projecting SAM-generated masks prompted by YOLO-World back onto the 3D point cloud. Part-level segmentation is subsequently performed by rendering multi-view…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization · Advanced Neural Network Applications
