Preserve Anything: Controllable Image Synthesis with Object Preservation
Prasen Kumar Sharma, Neeraj Matiyali, Siddharth Srivastava, Gaurav Sharma

TL;DR
This paper presents 'Preserve Anything', a new controllable image synthesis method that enhances object preservation, semantic consistency, and user control in text-to-image generation, achieving state-of-the-art results and a new benchmark dataset.
Contribution
It introduces a novel framework with object preservation and background guidance modules, along with a comprehensive benchmark dataset for evaluating image synthesis quality.
Findings
Achieves state-of-the-art FID 15.26 and CLIP-S 32.85 scores.
Significantly improves prompt alignment and photorealism in user studies.
Provides a new dataset with 240K natural and 18K synthetic images for evaluation.
Abstract
We introduce \textit{Preserve Anything}, a novel method for controlled image synthesis that addresses key limitations in object preservation and semantic consistency in text-to-image (T2I) generation. Existing approaches often fail (i) to preserve multiple objects with fidelity, (ii) maintain semantic alignment with prompts, or (iii) provide explicit control over scene composition. To overcome these challenges, the proposed method employs an N-channel ControlNet that integrates (i) object preservation with size and placement agnosticism, color and detail retention, and artifact elimination, (ii) high-resolution, semantically consistent backgrounds with accurate shadows, lighting, and prompt adherence, and (iii) explicit user control over background layouts and lighting conditions. Key components of our framework include object preservation and background guidance modules, enforcing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis
