RusCode: Russian Cultural Code Benchmark for Text-to-Image Generation
Viacheslav Vasilev, Julia Agafonova, Nikolai Gerasimenko, Alexander, Kapitanov, Polina Mikhailova, Evelina Mironova, Denis Dimitrov

TL;DR
This paper introduces RusCode, a benchmark dataset designed to evaluate how well text-to-image models capture Russian cultural elements, addressing cultural bias in AI-generated imagery.
Contribution
The paper presents a new benchmark dataset with 1250 prompts representing Russian culture, enabling assessment of cultural awareness in text-to-image models.
Findings
Human evaluation shows varying accuracy of models in representing Russian culture.
The benchmark reveals specific cultural biases and gaps in current models.
Results highlight the need for culturally-aware training data.
Abstract
Text-to-image generation models have gained popularity among users around the world. However, many of these models exhibit a strong bias toward English-speaking cultures, ignoring or misrepresenting the unique characteristics of other language groups, countries, and nationalities. The lack of cultural awareness can reduce the generation quality and lead to undesirable consequences such as unintentional insult, and the spread of prejudice. In contrast to the field of natural language processing, cultural awareness in computer vision has not been explored as extensively. In this paper, we strive to reduce this gap. We propose a RusCode benchmark for evaluating the quality of text-to-image generation containing elements of the Russian cultural code. To do this, we form a list of 19 categories that best represent the features of Russian visual culture. Our final dataset consists of 1250…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Natural Language Processing Techniques · Digital Humanities and Scholarship
