Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes
Lukas Rauch, Thomas Braml

TL;DR
This paper presents a method using pre-trained transformer models and transfer learning to efficiently segment complex structural components in point clouds from shell construction scenes, reducing annotation effort.
Contribution
It adapts existing datasets and transformer architectures for the challenging task of structural component segmentation in construction site point clouds, demonstrating effective transfer learning.
Findings
Pre-trained transformers achieve effective segmentation with minimal fine-tuning.
Transfer learning enhances performance on new construction site data.
Method reduces annotation effort for large-scale construction datasets.
Abstract
The significant effort required to annotate data for new training datasets hinders computer vision research and machine learning in the construction industry. This work explores adapting standard datasets and the latest transformer model architectures for point cloud semantic segmentation in the context of shell construction sites. Unlike common approaches focused on object segmentation of building interiors and furniture, this study addressed the challenges of segmenting complex structural components in Architecture, Engineering, and Construction (AEC). We establish a baseline through supervised training and a custom validation dataset, evaluate the cross-domain inference with large-scale indoor datasets, and utilize transfer learning to maximize segmentation performance with minimal new data. The findings indicate that with minimal fine-tuning, pre-trained transformer architectures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · 3D Surveying and Cultural Heritage
