Diffusion Features to Bridge Domain Gap for Semantic Segmentation

Yuxiang Ji; Boyong He; Chenyuan Qu; Zhuoyue Tan; Chuan Qin; Liaoni Wu

arXiv:2406.00777·cs.CV·November 22, 2024

Diffusion Features to Bridge Domain Gap for Semantic Segmentation

Yuxiang Ji, Boyong He, Chenyuan Qu, Zhuoyue Tan, Chuan Qin, Liaoni Wu

PDF

Open Access 1 Repo

TL;DR

This paper introduces DIFF, a novel diffusion feature fusion method that leverages pre-trained diffusion models to improve cross-domain semantic segmentation, achieving state-of-the-art results by effectively capturing universal features.

Contribution

The paper presents a new framework that utilizes diffusion models' implicit knowledge and sampling techniques to enhance domain generalization in semantic segmentation.

Findings

01

Outperforms previous methods in domain generalization tasks.

02

Achieves state-of-the-art benchmark results.

03

Effectively captures universal features across domains.

Abstract

Pre-trained diffusion models have demonstrated remarkable proficiency in synthesizing images across a wide range of scenarios with customizable prompts, indicating their effective capacity to capture universal features. Motivated by this, our study delves into the utilization of the implicit knowledge embedded within diffusion models to address challenges in cross-domain semantic segmentation. This paper investigates the approach that leverages the sampling and fusion techniques to harness the features of diffusion models efficiently. We propose DIffusion Feature Fusion (DIFF) as a backbone use for extracting and integrating effective semantic representations through the diffusion process. By leveraging the strength of text-to-image generation capability, we introduce a new training framework designed to implicitly learn posterior knowledge from it. Through rigorous evaluation in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Yux1angJi/DIFF
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsDiffusion