GeoMVD: Geometry-Enhanced Multi-View Generation Model Based on Geometric Information Extraction
Jiaqi Wu, Yaosen Chen, Shuyuan Zhu

TL;DR
GeoMVD introduces a geometry-guided multi-view diffusion model that extracts and leverages geometric information to generate consistent, high-quality multi-view images for applications like 3D reconstruction and virtual reality.
Contribution
The paper presents a novel geometry extraction module, a decoupled attention mechanism, and an adaptive intensity adjustment strategy to improve multi-view image generation quality and consistency.
Findings
Enhanced cross-view consistency in generated images.
Improved detail preservation and image quality.
Effective geometric information utilization in multi-view synthesis.
Abstract
Multi-view image generation holds significant application value in computer vision, particularly in domains like 3D reconstruction, virtual reality, and augmented reality. Most existing methods, which rely on extending single images, face notable computational challenges in maintaining cross-view consistency and generating high-resolution outputs. To address these issues, we propose the Geometry-guided Multi-View Diffusion Model, which incorporates mechanisms for extracting multi-view geometric information and adjusting the intensity of geometric features to generate images that are both consistent across views and rich in detail. Specifically, we design a multi-view geometry information extraction module that leverages depth maps, normal maps, and foreground segmentation masks to construct a shared geometric structure, ensuring shape and structural consistency across different views.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques
