ME-CPT: Multi-Task Enhanced Cross-Temporal Point Transformer for Urban 3D Change Detection
Luqi Zhang, Haiping Wang, Chong Liu, Zhen Dong, Bisheng Yang

TL;DR
This paper introduces ME-CPT, a novel multi-task transformer model that effectively captures 3D semantic changes in urban environments from multi-temporal point clouds, addressing existing challenges in change detection.
Contribution
The paper proposes a multi-task enhanced transformer network that models cross-temporal point cloud relationships and incorporates semantic segmentation to improve change detection accuracy.
Findings
ME-CPT outperforms existing methods on multiple datasets.
The approach effectively handles class imbalance in change detection.
A new 3D semantic change detection dataset covering 22.5 km² is released.
Abstract
The point clouds collected by the Airborne Laser Scanning (ALS) system provide accurate 3D information of urban land covers. By utilizing multi-temporal ALS point clouds, semantic changes in urban area can be captured, demonstrating significant potential in urban planning, emergency management, and infrastructure maintenance. Existing 3D change detection methods struggle to efficiently extract multi-class semantic information and change features, still facing the following challenges: (1) the difficulty of accurately modeling cross-temporal point clouds spatial relationships for effective change feature extraction; (2) class imbalance of change samples which hinders distinguishability of semantic features; (3) the lack of real-world datasets for 3D semantic change detection. To resolve these challenges, we propose the Multi-task Enhanced Cross-temporal Point Transformer (ME-CPT)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing and Land Use · Remote Sensing and LiDAR Applications · Automated Road and Building Extraction
MethodsAttention Is All You Need · Softmax · Residual Connection · Dropout · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing
