PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation
Liyao Jiang, Negar Hassanpour, Mohammad Salameh, Mohammadreza Samadi,, Jiao He, Fengyu Sun, Di Niu

TL;DR
PixelMan is a novel, efficient, inversion-free method for consistent object editing in images using diffusion models, achieving high-quality results with fewer inference steps compared to existing approaches.
Contribution
It introduces a training-free, pixel manipulation-based approach that ensures image consistency and outperforms state-of-the-art methods in fewer inference steps.
Findings
Outperforms existing methods in fewer steps
Maintains high image consistency during editing
Requires no training or inversion process
Abstract
Recent research explores the potential of Diffusion Models (DMs) for consistent object editing, which aims to modify object position, size, and composition, etc., while preserving the consistency of objects and background without changing their texture and attributes. Current inference-time methods often rely on DDIM inversion, which inherently compromises efficiency and the achievable consistency of edited images. Recent methods also utilize energy guidance which iteratively updates the predicted noise and can drive the latents away from the original image, resulting in distortions. In this paper, we propose PixelMan, an inversion-free and training-free method for achieving consistent object editing via Pixel Manipulation and generation, where we directly create a duplicate copy of the source object at target location in the pixel space, and introduce an efficient sampling approach to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · Modular Robots and Swarm Intelligence
MethodsDiffusion
