TL;DR
AlbumFill is a training-free framework that retrieves identity-consistent references from personal albums to improve personalized image completion, addressing limitations of existing methods.
Contribution
It introduces a novel retrieval-guided approach for personalized image completion using vision-language models and a new dataset of 54K samples.
Findings
Retrieval-based methods outperform generic inpainting in maintaining identity.
The framework effectively retrieves identity-consistent references from personal albums.
Experiments highlight the importance of identity-aware retrieval for personalized completion.
Abstract
Personalized image completion aims to restore occluded regions in personal photos while preserving identity and appearance. Existing methods either rely on generic inpainting models that often fail to maintain identity consistency, or assume that suitable reference images are explicitly provided. In practice, suitable references are often not explicitly provided, requiring the system to search for identity-consistent images within personal photo collections. We present AlbumFill, a training-free framework that retrieves identity-consistent references from personal albums for personalized completion. Given an occluded image and a personal album, a vision-language model infers missing semantic cues to guide composed image retrieval, and the retrieved references are used by reference-based completion models. To facilitate this task, we introduce a dataset containing 54K human-centric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
