AlignCVC: Aligning Cross-View Consistency for Single-Image-to-3D Generation

Xinyue Liang; Zhiyuan Ma; Lingchen Sun; Yanjun Guo; Lei Zhang

arXiv:2506.23150·cs.CV·November 26, 2025

AlignCVC: Aligning Cross-View Consistency for Single-Image-to-3D Generation

Xinyue Liang, Zhiyuan Ma, Lingchen Sun, Yanjun Guo, Lei Zhang

PDF

Open Access

TL;DR

AlignCVC introduces a distribution alignment framework that improves cross-view consistency in single-image-to-3D generation, enhancing quality and speed by focusing on distribution matching rather than strict regression.

Contribution

The paper proposes a novel distribution alignment approach for better cross-view consistency, enabling faster inference and seamless integration with existing models.

Findings

01

Significantly improves 3D reconstruction quality.

02

Reduces inference steps to as few as 4.

03

Demonstrates effectiveness across various models.

Abstract

Single-image-to-3D models typically follow a sequential generation and reconstruction workflow. However, intermediate multi-view images synthesized by pre-trained generation models often lack cross-view consistency (CVC), significantly degrading 3D reconstruction performance. While recent methods attempt to refine CVC by feeding reconstruction results back into the multi-view generator, these approaches struggle with noisy and unstable reconstruction outputs that limit effective CVC improvement. We introduce AlignCVC, a novel framework that fundamentally re-frames single-image-to-3D generation through distribution alignment rather than relying on strict regression losses. Our key insight is to align both generated and reconstructed multi-view distributions toward the ground-truth multi-view distribution, establishing a principled foundation for improved CVC. Observing that generated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques