LSAP: Rethinking Inversion Fidelity, Perception and Editability in GAN Latent Space
Xuekun Zhao, Pu Cao, Xiaoya Yang, Mingjian Zhang, Lu Yang, Qing Song

TL;DR
This paper introduces LSAP, a new inversion paradigm that aligns latent codes with the synthetic distribution to improve both reconstruction fidelity and perceptual/editability in GAN image inversion.
Contribution
The paper proposes the LSAP framework and the NSCD metric, unifying and optimizing inversion to enhance perception and editability alongside fidelity.
Findings
NSCD effectively captures perceptual and editable qualities.
LSAP achieves state-of-the-art performance in image inversion.
Unified framework benefits both encoder and optimization-based methods.
Abstract
As research on image inversion advances, the process is generally divided into two stages. The first step is Image Embedding, involves using an encoder or optimization procedure to embed an image and obtain its corresponding latent code. The second stage, referred to as Result Refinement, further improves the inversion and editing outcomes. Although this refinement stage substantially enhances reconstruction fidelity, perception and editability remain largely unchanged and are highly dependent on the latent codes derived from the first stage. Therefore, a key challenge lies in obtaining latent codes that preserve reconstruction fidelity while simultaneously improving perception and editability. In this work, we first reveal that these two properties are closely related to the degree of alignment (or disalignment) between the inverted latent codes and the synthetic distribution. Based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis
