Toon3D: Seeing Cartoons from New Perspectives
Ethan Weber, Riley Peterlinz, Rohan Mathur, Frederik Warburg, Alexei, A. Efros, Angjoo Kanazawa

TL;DR
Toon3D introduces a method to reconstruct 3D structures from cartoon and anime images by deforming images to achieve geometric consistency, enabling novel-view synthesis despite the lack of explicit 3D cues.
Contribution
The paper presents Toon3D, a novel approach that deforms cartoon images to recover consistent 3D structure, addressing the limitations of traditional SfM methods on creative media.
Findings
Outperforms classical SfM in geometric consistency.
Enables reliable camera pose estimation from inconsistent images.
Provides a dataset with annotated multi-view cartoon imagery.
Abstract
We recover the underlying 3D structure from images of cartoons and anime depicting the same scene. This is an interesting problem domain because images in creative media are often depicted without explicit geometric consistency for storytelling and creative expression-they are only 3D in a qualitative sense. While humans can easily perceive the underlying 3D scene from these images, existing Structure-from-Motion (SfM) methods that assume 3D consistency fail catastrophically. We present Toon3D for reconstructing geometrically inconsistent images. Our key insight is to deform the input images while recovering camera poses and scene geometry, effectively explaining away geometrical inconsistencies to achieve consistency. This process is guided by the structure inferred from monocular depth predictions. We curate a dataset with multi-view imagery from cartoons and anime that we annotate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation
MethodsFocus
