MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing
Chenjie Cao, Chaohui Yu, Fan Wang, Xiangyang Xue, Yanwei Fu

TL;DR
MVInpainter introduces a multi-view 2D inpainting approach for 3D editing that enhances generalization to real-world scenes and eliminates the need for explicit camera pose information, enabling diverse scene editing tasks.
Contribution
It reformulates 3D editing as multi-view inpainting, leveraging reference guidance and video priors to ensure cross-view consistency without pose dependence.
Findings
Effective on object-centric and forward-facing datasets.
Enables multi-view object removal, synthesis, insertion, and replacement.
Outperforms existing methods in in-the-wild scenarios.
Abstract
Novel View Synthesis (NVS) and 3D generation have recently achieved prominent improvements. However, these works mainly focus on confined categories or synthetic 3D assets, which are discouraged from generalizing to challenging in-the-wild scenes and fail to be employed with 2D synthesis directly. Moreover, these methods heavily depended on camera poses, limiting their real-world applications. To overcome these issues, we propose MVInpainter, re-formulating the 3D editing as a multi-view 2D inpainting task. Specifically, MVInpainter partially inpaints multi-view images with the reference guidance rather than intractably generating an entirely novel view from scratch, which largely simplifies the difficulty of in-the-wild NVS and leverages unmasked clues instead of explicit pose conditions. To ensure cross-view consistency, MVInpainter is enhanced by video priors from motion components…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques
MethodsSoftmax · Attention Is All You Need · Focus · Inpainting
