Crossing Borders: A Multimodal Challenge for Indian Poetry Translation and Image Generation
Sofia Jamil, Kotla Sai Charan, Sriparna Saha, Koustava Goswami, Joseph K J

TL;DR
This paper introduces TAI, a framework combining translation and image generation to make Indian poetry accessible globally, using LLMs and diffusion models, and provides a new dataset for low-resource Indian languages.
Contribution
The paper presents a novel multimodal framework for translating and visually representing Indian poetry, including a new dataset for low-resource languages and a semantic graph-based image generation approach.
Findings
TAI Diffusion outperforms baselines in poem image generation
The Odds Ratio Preference Alignment Algorithm improves translation accuracy
The Morphologically Rich Indian Language Poems MorphoVerse Dataset supports low-resource languages
Abstract
Indian poetry, known for its linguistic complexity and deep cultural resonance, has a rich and varied heritage spanning thousands of years. However, its layered meanings, cultural allusions, and sophisticated grammatical constructions often pose challenges for comprehension, especially for non-native speakers or readers unfamiliar with its context and language. Despite its cultural significance, existing works on poetry have largely overlooked Indian language poems. In this paper, we propose the Translation and Image Generation (TAI) framework, leveraging Large Language Models (LLMs) and Latent Diffusion Models through appropriate prompt tuning. Our framework supports the United Nations Sustainable Development Goals of Quality Education (SDG 4) and Reduced Inequalities (SDG 10) by enhancing the accessibility of culturally rich Indian-language poetry to a global audience. It includes (1)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsLanguage, Metaphor, and Cognition · Multimodal Machine Learning Applications · Artificial Intelligence in Games
