Geometry-Aware Network for Domain Adaptive Semantic Segmentation

Yinghong Liao; Wending Zhou; Xu Yan; Shuguang Cui; Yizhou Yu; Zhen Li

arXiv:2212.00920·cs.CV·December 6, 2022

Geometry-Aware Network for Domain Adaptive Semantic Segmentation

Yinghong Liao, Wending Zhou, Xu Yan, Shuguang Cui, Yizhou Yu, Zhen Li

PDF

Open Access

TL;DR

This paper introduces GANDA, a geometry-aware network that uses 3D point cloud representations and domain-invariant geometric adaptation to improve unsupervised domain adaptation in semantic segmentation, outperforming existing methods.

Contribution

The novel GANDA framework leverages 3D geometric information and point cloud topology for better domain adaptation in semantic segmentation tasks.

Findings

01

Outperforms state-of-the-art on GTA5->Cityscapes and SYNTHIA->Cityscapes benchmarks.

02

Effectively utilizes 3D topology for pseudo-label refinement.

03

Enhances 2D classifier with domain-invariant geometric features.

Abstract

Measuring and alleviating the discrepancies between the synthetic (source) and real scene (target) data is the core issue for domain adaptive semantic segmentation. Though recent works have introduced depth information in the source domain to reinforce the geometric and semantic knowledge transfer, they cannot extract the intrinsic 3D information of objects, including positions and shapes, merely based on 2D estimated depth. In this work, we propose a novel Geometry-Aware Network for Domain Adaptation (GANDA), leveraging more compact 3D geometric point cloud representations to shrink the domain gaps. In particular, we first utilize the auxiliary depth supervision from the source domain to obtain the depth prediction in the target domain to accomplish structure-texture disentanglement. Beyond depth estimation, we explicitly exploit 3D topology on the point clouds generated from RGB-D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Vision and Imaging · Human Pose and Action Recognition