Geometry-Aware Network for Domain Adaptive Semantic Segmentation
Yinghong Liao, Wending Zhou, Xu Yan, Shuguang Cui, Yizhou Yu, Zhen Li

TL;DR
This paper introduces GANDA, a geometry-aware network that uses 3D point cloud representations and domain-invariant geometric adaptation to improve unsupervised domain adaptation in semantic segmentation, outperforming existing methods.
Contribution
The novel GANDA framework leverages 3D geometric information and point cloud topology for better domain adaptation in semantic segmentation tasks.
Findings
Outperforms state-of-the-art on GTA5->Cityscapes and SYNTHIA->Cityscapes benchmarks.
Effectively utilizes 3D topology for pseudo-label refinement.
Enhances 2D classifier with domain-invariant geometric features.
Abstract
Measuring and alleviating the discrepancies between the synthetic (source) and real scene (target) data is the core issue for domain adaptive semantic segmentation. Though recent works have introduced depth information in the source domain to reinforce the geometric and semantic knowledge transfer, they cannot extract the intrinsic 3D information of objects, including positions and shapes, merely based on 2D estimated depth. In this work, we propose a novel Geometry-Aware Network for Domain Adaptation (GANDA), leveraging more compact 3D geometric point cloud representations to shrink the domain gaps. In particular, we first utilize the auxiliary depth supervision from the source domain to obtain the depth prediction in the target domain to accomplish structure-texture disentanglement. Beyond depth estimation, we explicitly exploit 3D topology on the point clouds generated from RGB-D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Vision and Imaging · Human Pose and Action Recognition
