Syllables to Scenes: Literary-Guided Free-Viewpoint 3D Scene Synthesis from Japanese Haiku
Chunan Yu, Yidong Han, Chaotao Ding, Ying Zang, Lanyun Zhu, Xinhao, Chen, Zejian Li, Renjun Xu, Tianrun Chen

TL;DR
This paper presents HaikuVerse, a novel framework that translates Japanese Haiku poetry into detailed 3D scenes, combining literary analysis with advanced generative techniques to preserve emotional and visual fidelity in immersive environments.
Contribution
It introduces a hierarchical literary-criticism grounded parsing method and a multi-stage synthesis pipeline for accurate poetic-to-3D scene generation, advancing beyond existing text-to-3D approaches.
Findings
Outperforms traditional methods in literary fidelity
Produces higher quality and more coherent 3D scenes
Successfully captures emotional and visual nuances of poetry
Abstract
In the era of the metaverse, where immersive technologies redefine human experiences, translating abstract literary concepts into navigable 3D environments presents a fundamental challenge in preserving semantic and emotional fidelity. This research introduces HaikuVerse, a novel framework for transforming poetic abstraction into spatial representation, with Japanese Haiku serving as an ideal test case due to its sophisticated encapsulation of profound emotions and imagery within minimal text. While existing text-to-3D methods struggle with nuanced interpretations, we present a literary-guided approach that synergizes traditional poetry analysis with advanced generative technologies. Our framework centers on two key innovations: (1) Hierarchical Literary-Criticism Theory Grounded Parsing (H-LCTGP), which captures both explicit imagery and implicit emotional resonance through structured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Human Motion and Animation · 3D Modeling in Geospatial Applications
MethodsDiffusion
