Twist and Compute: The Cost of Pose in 3D Generative Diffusion

Kyle Fogarty; Jack Foster; Boqiao Zhang; Jing Yang; Cengiz \"Oztireli

arXiv:2511.08203·cs.CV·November 12, 2025

Twist and Compute: The Cost of Pose in 3D Generative Diffusion

Kyle Fogarty, Jack Foster, Boqiao Zhang, Jing Yang, Cengiz \"Oztireli

PDF

Open Access

TL;DR

This paper investigates the limitations of large-scale image-to-3D generative models, revealing a strong canonical view bias and proposing a lightweight correction method to improve viewpoint generalization.

Contribution

The study identifies a canonical view bias in 3D generative models and introduces a simple CNN-based correction approach to mitigate it without altering the core model.

Findings

01

Model performance degrades with rotated inputs.

02

A lightweight CNN can correct input orientation effectively.

03

The correction restores model performance across viewpoints.

Abstract

Despite their impressive results, large-scale image-to-3D generative models remain opaque in their inductive biases. We identify a significant limitation in image-conditioned 3D generative models: a strong canonical view bias. Through controlled experiments using simple 2D rotations, we show that the state-of-the-art Hunyuan3D 2.0 model can struggle to generalize across viewpoints, with performance degrading under rotated inputs. We show that this failure can be mitigated by a lightweight CNN that detects and corrects input orientation, restoring model performance without modifying the generative backbone. Our findings raise an important open question: Is scale enough, or should we pursue modular, symmetry-aware designs?

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · 3D Shape Modeling and Analysis