OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Cong Wei, Zheyang Xiong, Weiming Ren, Xinrun Du, Ge Zhang, Wenhu Chen

TL;DR
OmniEdit is a versatile image editing model trained on diverse specialist supervision, utilizing improved data quality and a novel architecture to handle multiple tasks and aspect ratios, outperforming existing methods.
Contribution
The paper introduces OmniEdit, a generalist image editing model trained with specialist supervision, importance sampling for data quality, and a new architecture called EditNet for improved success.
Findings
Outperforms existing image editing models in automatic and human evaluations.
Handles seven different editing tasks across various aspect ratios.
Demonstrates significant improvements in editing success rate.
Abstract
Instruction-guided image editing methods have demonstrated significant potential by training diffusion models on automatically synthesized or manually annotated image editing pairs. However, these methods remain far from practical, real-life applications. We identify three primary challenges contributing to this gap. Firstly, existing models have limited editing skills due to the biased synthesis process. Secondly, these methods are trained with datasets with a high volume of noise and artifacts. This is due to the application of simple filtering methods like CLIP-score. Thirdly, all these datasets are restricted to a single low resolution and fixed aspect ratio, limiting the versatility to handle real-world use cases. In this paper, we present \omniedit, which is an omnipotent editor to handle seven different image editing tasks with any aspect ratio seamlessly. Our contribution is in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · AI in cancer detection · Image Retrieval and Classification Techniques
MethodsDiffusion · Sparse Evolutionary Training
