DP-Adapter: Dual-Pathway Adapter for Boosting Fidelity and Text   Consistency in Customizable Human Image Generation

Ye Wang; Xuping Xie; Lanjun Wang; Zili Yi; Rui Ma

arXiv:2502.13999·cs.GR·February 21, 2025

DP-Adapter: Dual-Pathway Adapter for Boosting Fidelity and Text Consistency in Customizable Human Image Generation

Ye Wang, Xuping Xie, Lanjun Wang, Zili Yi, Rui Ma

PDF

Open Access

TL;DR

The paper introduces DP-Adapter, a dual-pathway approach that enhances fidelity and textual consistency in personalized human image generation by decoupling visual and textual influences and integrating them through a feature blending module.

Contribution

It proposes a novel dual-pathway adapter with specialized modules for identity preservation and textual consistency, improving personalized human image synthesis.

Findings

01

Improved identity preservation in generated images.

02

Enhanced textual consistency with reduced visual interference.

03

Supports diverse applications like age editing and expression modification.

Abstract

With the growing popularity of personalized human content creation and sharing, there is a rising demand for advanced techniques in customized human image generation. However, current methods struggle to simultaneously maintain the fidelity of human identity and ensure the consistency of textual prompts, often resulting in suboptimal outcomes. This shortcoming is primarily due to the lack of effective constraints during the simultaneous integration of visual and textual prompts, leading to unhealthy mutual interference that compromises the full expression of both types of input. Building on prior research that suggests visual and textual conditions influence different regions of an image in distinct ways, we introduce a novel Dual-Pathway Adapter (DP-Adapter) to enhance both high-fidelity identity preservation and textual consistency in personalized human image generation. Our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Generative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques