HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images

Yichen Liu; Donghao Zhou; Jie Wang; Xin Gao; Guisheng Liu; Jiatong Li; Quanwei Zhang; Qiang Lyu; Lanqing Guo; Shilei Wen; Weiqiang Wang; Pheng-Ann Heng

arXiv:2603.02210·cs.CV·April 20, 2026

HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images

Yichen Liu, Donghao Zhou, Jie Wang, Xin Gao, Guisheng Liu, Jiatong Li, Quanwei Zhang, Qiang Lyu, Lanqing Guo, Shilei Wen, Weiqiang Wang, Pheng-Ann Heng

PDF

1 Repo 1 Datasets

TL;DR

HiFi-Inpaint is a new framework for high-fidelity reference-based inpainting that preserves product details in human-product images, utilizing novel attention and loss mechanisms, and trained on a large curated dataset.

Contribution

It introduces Shared Enhancement Attention and Detail-Aware Loss for improved detail preservation, along with a new dataset HP-Image-40K for training.

Findings

01

Achieves state-of-the-art detail preservation in human-product image inpainting.

02

Outperforms existing methods on the HP-Image-40K dataset.

03

Demonstrates effective guidance of product details through proposed mechanisms.

Abstract

Human-product images, which showcase the integration of humans and products, play a vital role in advertising, e-commerce, and digital marketing. The essential challenge of generating such images lies in ensuring the high-fidelity preservation of product details. Among existing paradigms, reference-based inpainting offers a targeted solution by leveraging product reference images to guide the inpainting process. However, limitations remain in three key aspects: the lack of diverse large-scale training data, the struggle of current models to focus on product detail preservation, and the inability of coarse supervision for achieving precise guidance. To address these issues, we propose HiFi-Inpaint, a novel high-fidelity reference-based inpainting framework tailored for generating human-product images. HiFi-Inpaint introduces Shared Enhancement Attention (SEA) to refine fine-grained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

correr-zhou/HiFi-Inpaint
github

Datasets

donghao-zhou/HP-Image-40K
dataset· 1.4k dl
1.4k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.