MTADiffusion: Mask Text Alignment Diffusion Model for Object Inpainting
Jun Huang, Ting Liu, Yihang Wu, Xiaochao Qu, Luoqi Liu, Xiaolin Hu

TL;DR
MTADiffusion is a novel diffusion-based object inpainting model that uses a new dataset, multi-task training, and style consistency techniques to improve semantic accuracy, structural stability, and style coherence in image inpainting.
Contribution
The paper introduces MTADiffusion, a diffusion model with a new dataset, MTAPipeline, and multi-task training for improved object inpainting quality.
Findings
Achieves state-of-the-art results on BrushBench and EditBench.
Effectively maintains semantic alignment and structural integrity.
Demonstrates superior style consistency in inpainted images.
Abstract
Advancements in generative models have enabled image inpainting models to generate content within specific regions of an image based on provided prompts and masks. However, existing inpainting methods often suffer from problems such as semantic misalignment, structural distortion, and style inconsistency. In this work, we present MTADiffusion, a Mask-Text Alignment diffusion model designed for object inpainting. To enhance the semantic capabilities of the inpainting model, we introduce MTAPipeline, an automatic solution for annotating masks with detailed descriptions. Based on the MTAPipeline, we construct a new MTADataset comprising 5 million images and 25 million mask-text pairs. Furthermore, we propose a multi-task training strategy that integrates both inpainting and edge prediction tasks to improve structural stability. To promote style consistency, we present a novel inpainting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Aesthetic Perception and Analysis · Computer Graphics and Visualization Techniques
MethodsInpainting · Diffusion
