Are Video Generation Models Geographically Fair? An Attraction-Centric Evaluation of Global Visual Knowledge
Xiao Liu, Jiawei Zhang

TL;DR
This paper introduces a new framework and benchmark to evaluate the geographic fairness of text-to-video models, revealing that current models like Sora 2 encode geographically grounded visual knowledge more evenly than previously assumed.
Contribution
The paper presents Geo-Attraction Landmark Probing (GAP), a systematic evaluation framework, and GEOATTRACTION-500, a benchmark for assessing geographic fairness in video generation models.
Findings
Models show relatively uniform geographic visual knowledge across regions.
Current models exhibit weak dependence on attraction popularity.
Results suggest models encode global visual knowledge more evenly than expected.
Abstract
Recent advances in text-to-video generation have produced visually compelling results, yet it remains unclear whether these models encode geographically equitable visual knowledge. In this work, we investigate the geo-equity and geographically grounded visual knowledge of text-to-video models through an attraction-centric evaluation. We introduce Geo-Attraction Landmark Probing (GAP), a systematic framework for assessing how faithfully models synthesize tourist attractions from diverse regions, and construct GEOATTRACTION-500, a benchmark of 500 globally distributed attractions spanning varied regions and popularity levels. GAP integrates complementary metrics that disentangle overall video quality from attraction-specific knowledge, including global structural alignment, fine-grained keypoint-based alignment, and vision-language model judgments, all validated against human evaluation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Diverse Aspects of Tourism Research · Geographic Information Systems Studies
