InsHuman: Towards Natural and Identity-Preserving Human Insertion
Jie Li, Shulian Zhang, Yangyang Gao, Wenbo Li, Yulun Zhang, Yong Guo, Jian Chen

TL;DR
InsHuman introduces a novel method for natural human insertion into images that preserves identity and adapts to backgrounds, addressing limitations of previous models and datasets.
Contribution
The paper proposes HBAF, FFIP, and BDP strategies to improve human insertion quality, identity preservation, and dataset realism in image editing.
Findings
InsHuman outperforms existing models in generating realistic, identity-preserving human insertions.
The proposed methods ensure coherent pose, count, and appearance alignment with backgrounds.
Experiments validate the effectiveness of InsHuman in producing high-quality images with preserved identities.
Abstract
Human insertion aims to naturally place specific individuals into a target background. Although existing image editing models may have such ability, they often produce failure cases, including inappropriate human pose in new background, inconsistent number of people, and modified facial identity. Moreover, publicly available human datasets often lack full-body portraits and realistic physical interaction between humans and their background. To address these challenges, we propose InsHuman for natural and identity-preserving human insertion. Specifically, we propose Human-Background Adaptive Fusion (HBAF), which detects foreground humans to obtain a binary mask and applies region-aware weighting to align the human regions between predicted and ground-truth latents, ensuring the person's pose, count, and overall appearance are coherently adapted to the target background.We further propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
