Depth-Enhanced Feature Pyramid Network for Occlusion-Aware Verification of Buildings from Oblique Images
Qing Zhu, Shengzhi Huang, Han Hu, Haifeng Li, Min Chen and, Ruofei Zhong

TL;DR
This paper introduces a depth-enhanced feature pyramid network that fuses color and depth data from oblique images to improve the accuracy of building change detection in urban environments, effectively handling occlusions and scale variations.
Contribution
It proposes a novel fused feature pyramid network utilizing both color and depth data, along with multi-view voting, to enhance building change detection accuracy from oblique images.
Findings
Outperforms ResNet and EfficientNet in recall and precision.
Successfully detects all changed buildings, reducing manual verification.
Depth data and multi-view strategies significantly improve detection performance.
Abstract
Detecting the changes of buildings in urban environments is essential. Existing methods that use only nadir images suffer from severe problems of ambiguous features and occlusions between buildings and other regions. Furthermore, buildings in urban environments vary significantly in scale, which leads to performance issues when using single-scale features. To solve these issues, this paper proposes a fused feature pyramid network, which utilizes both color and depth data for the 3D verification of existing buildings 2D footprints from oblique images. First, the color data of oblique images are enriched with the depth information rendered from 3D mesh models. Second, multiscale features are fused in the feature pyramid network to convolve both the color and depth data. Finally, multi-view information from both the nadir and oblique images is used in a robust voting procedure to label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDepthwise Convolution · Max Pooling · Bottleneck Residual Block · Residual Connection · Pointwise Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Depthwise Separable Convolution · Kaiming Initialization · Batch Normalization · Residual Block
