DPG-CD: Depth-Prior-Guided Cross-Modal Joint 2D-3D Change Detection
Luqi Zhang, Zhen Dong, Bisheng Yang

TL;DR
DPG-CD introduces a depth-prior-guided fusion framework for joint 2D semantic and 3D height change detection using multi-temporal cross-modal data, addressing spectral-geometric gaps and modality differences.
Contribution
It proposes a novel depth prior integration and multi-stage fusion architecture for effective 2D-3D change detection in urban environments.
Findings
Outperforms state-of-the-art methods on public datasets.
Effectively fuses semantic and geometric features for change detection.
Improves structural consistency and height estimation accuracy.
Abstract
Urban spatial evolution is manifested not only through horizontal expansion but also through vertical structural changes. Consequently, jointly capturing 2D semantic changes and 3D height changes is essential for urban morphology analysis and emergency management. In practical scenarios, collecting 3D observations is often constrained by high acquisition costs and the inability to support frequent updates. The multi-temporal cross-modal input consisting of pre-event Digital Surface Model (DSM) and post-event imagery provides a practical solution for 3D change detection in high-frequency urban monitoring, disaster assessment, and emergency response scenarios. However, this setting remains challenging as imagery and DSM data exhibit significant spectral-geometric representation gaps. Moreover, modality differences may be confused with actual changes, and robust change detection requires…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
