Syllables to Scenes: Literary-Guided Free-Viewpoint 3D Scene Synthesis   from Japanese Haiku

Chunan Yu; Yidong Han; Chaotao Ding; Ying Zang; Lanyun Zhu; Xinhao; Chen; Zejian Li; Renjun Xu; Tianrun Chen

arXiv:2502.11586·cs.CV·February 18, 2025

Syllables to Scenes: Literary-Guided Free-Viewpoint 3D Scene Synthesis from Japanese Haiku

Chunan Yu, Yidong Han, Chaotao Ding, Ying Zang, Lanyun Zhu, Xinhao, Chen, Zejian Li, Renjun Xu, Tianrun Chen

PDF

Open Access

TL;DR

This paper presents HaikuVerse, a novel framework that translates Japanese Haiku poetry into detailed 3D scenes, combining literary analysis with advanced generative techniques to preserve emotional and visual fidelity in immersive environments.

Contribution

It introduces a hierarchical literary-criticism grounded parsing method and a multi-stage synthesis pipeline for accurate poetic-to-3D scene generation, advancing beyond existing text-to-3D approaches.

Findings

01

Outperforms traditional methods in literary fidelity

02

Produces higher quality and more coherent 3D scenes

03

Successfully captures emotional and visual nuances of poetry

Abstract

In the era of the metaverse, where immersive technologies redefine human experiences, translating abstract literary concepts into navigable 3D environments presents a fundamental challenge in preserving semantic and emotional fidelity. This research introduces HaikuVerse, a novel framework for transforming poetic abstraction into spatial representation, with Japanese Haiku serving as an ideal test case due to its sophisticated encapsulation of profound emotions and imagery within minimal text. While existing text-to-3D methods struggle with nuanced interpretations, we present a literary-guided approach that synergizes traditional poetry analysis with advanced generative technologies. Our framework centers on two key innovations: (1) Hierarchical Literary-Criticism Theory Grounded Parsing (H-LCTGP), which captures both explicit imagery and implicit emotional resonance through structured…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Human Motion and Animation · 3D Modeling in Geospatial Applications

MethodsDiffusion