360SFUDA++: Towards Source-free UDA for Panoramic Segmentation by   Learning Reliable Category Prototypes

Xu Zheng; Pengyuan Zhou; Athanasios V. Vasilakos; Lin Wang

arXiv:2404.16501·cs.CV·April 26, 2024

360SFUDA++: Towards Source-free UDA for Panoramic Segmentation by Learning Reliable Category Prototypes

Xu Zheng, Pengyuan Zhou, Athanasios V. Vasilakos, Lin Wang

PDF

Open Access

TL;DR

This paper introduces 360SFUDA++, a novel method for source-free unsupervised domain adaptation in panoramic segmentation, effectively addressing challenges of view mismatch, style differences, and image distortion through innovative projection and prototype adaptation techniques.

Contribution

The paper proposes a new framework combining tangent projection, reliable prototype adaptation, and cross-projection attention to improve panoramic segmentation without source data.

Findings

01

Achieves significant performance improvements over prior SFUDA methods.

02

Effectively handles domain gaps caused by FoV and style discrepancies.

03

Demonstrates robustness across outdoor and indoor panoramic benchmarks.

Abstract

In this paper, we address the challenging source-free unsupervised domain adaptation (SFUDA) for pinhole-to-panoramic semantic segmentation, given only a pinhole image pre-trained model (i.e., source) and unlabeled panoramic images (i.e., target). Tackling this problem is non-trivial due to three critical challenges: 1) semantic mismatches from the distinct Field-of-View (FoV) between domains, 2) style discrepancies inherent in the UDA problem, and 3) inevitable distortion of the panoramic images. To tackle these problems, we propose 360SFUDA++ that effectively extracts knowledge from the source pinhole model with only unlabeled panoramic images and transfers the reliable knowledge to the target panoramic domain. Specifically, we first utilize Tangent Projection (TP) as it has less distortion and meanwhile slits the equirectangular projection (ERP) to patches with fixed FoV projection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Human Pose and Action Recognition · Image Processing and 3D Reconstruction