AlignCVC: Aligning Cross-View Consistency for Single-Image-to-3D Generation
Xinyue Liang, Zhiyuan Ma, Lingchen Sun, Yanjun Guo, Lei Zhang

TL;DR
AlignCVC introduces a distribution alignment framework that improves cross-view consistency in single-image-to-3D generation, enhancing quality and speed by focusing on distribution matching rather than strict regression.
Contribution
The paper proposes a novel distribution alignment approach for better cross-view consistency, enabling faster inference and seamless integration with existing models.
Findings
Significantly improves 3D reconstruction quality.
Reduces inference steps to as few as 4.
Demonstrates effectiveness across various models.
Abstract
Single-image-to-3D models typically follow a sequential generation and reconstruction workflow. However, intermediate multi-view images synthesized by pre-trained generation models often lack cross-view consistency (CVC), significantly degrading 3D reconstruction performance. While recent methods attempt to refine CVC by feeding reconstruction results back into the multi-view generator, these approaches struggle with noisy and unstable reconstruction outputs that limit effective CVC improvement. We introduce AlignCVC, a novel framework that fundamentally re-frames single-image-to-3D generation through distribution alignment rather than relying on strict regression losses. Our key insight is to align both generated and reconstructed multi-view distributions toward the ground-truth multi-view distribution, establishing a principled foundation for improved CVC. Observing that generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques
