ME-CPT: Multi-Task Enhanced Cross-Temporal Point Transformer for Urban   3D Change Detection

Luqi Zhang; Haiping Wang; Chong Liu; Zhen Dong; Bisheng Yang

arXiv:2501.14004·cs.CV·February 20, 2025

ME-CPT: Multi-Task Enhanced Cross-Temporal Point Transformer for Urban 3D Change Detection

Luqi Zhang, Haiping Wang, Chong Liu, Zhen Dong, Bisheng Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces ME-CPT, a novel multi-task transformer model that effectively captures 3D semantic changes in urban environments from multi-temporal point clouds, addressing existing challenges in change detection.

Contribution

The paper proposes a multi-task enhanced transformer network that models cross-temporal point cloud relationships and incorporates semantic segmentation to improve change detection accuracy.

Findings

01

ME-CPT outperforms existing methods on multiple datasets.

02

The approach effectively handles class imbalance in change detection.

03

A new 3D semantic change detection dataset covering 22.5 km² is released.

Abstract

The point clouds collected by the Airborne Laser Scanning (ALS) system provide accurate 3D information of urban land covers. By utilizing multi-temporal ALS point clouds, semantic changes in urban area can be captured, demonstrating significant potential in urban planning, emergency management, and infrastructure maintenance. Existing 3D change detection methods struggle to efficiently extract multi-class semantic information and change features, still facing the following challenges: (1) the difficulty of accurately modeling cross-temporal point clouds spatial relationships for effective change feature extraction; (2) class imbalance of change samples which hinders distinguishability of semantic features; (3) the lack of real-world datasets for 3D semantic change detection. To resolve these challenges, we propose the Multi-task Enhanced Cross-temporal Point Transformer (ME-CPT)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhangluqi0209/me-cpt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRemote Sensing and Land Use · Remote Sensing and LiDAR Applications · Automated Road and Building Extraction

MethodsAttention Is All You Need · Softmax · Residual Connection · Dropout · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing