A Progressive Evaluation Framework for Multicultural Analysis of Story Visualization
Janak Kapuriya, Ali Hatami, Paul Buitelaar

TL;DR
This paper introduces a comprehensive framework for evaluating the cultural appropriateness of story visualization models across multiple languages, highlighting biases and proposing new metrics and automated assessment methods.
Contribution
It presents a novel Progressive Multicultural Evaluation Framework with five new metrics and an MLLM-based automated assessment, addressing cultural fidelity in story visualization.
Findings
Models perform better on real-world datasets than animated ones.
Cultural biases are evident, with English and Chinese cultures showing different strengths.
Hindi shows lower performance, indicating cultural bias in models.
Abstract
Recent advancements in text-to-image generative models have improved narrative consistency in story visualization. However, current story visualization models often overlook cultural dimensions, resulting in visuals that lack authenticity and cultural fidelity. In this study, we conduct a comprehensive multicultural analysis of story visualization using current text-to-image models across multilingual settings on two datasets: FlintstonesSV and VIST. To assess cultural dimensions rigorously, we propose a Progressive Multicultural Evaluation Framework and introduce five story visualization metrics, Cultural Appropriateness, Visual Aesthetics, Cohesion, Semantic Consistency, and Object Presence, that are not addressed by existing metrics. We further automate assessment through an MLLM-as-Jury framework that approximates human judgment. Human evaluations show that models generate more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Multimodal Machine Learning Applications · Artificial Intelligence in Games
