Benchmarking Multimodal Models for Ukrainian Language Understanding Across Academic and Cultural Domains
Yurii Paniv, Artur Kiulian, Dmytro Chaplynskyi, Mykola Khandoga, Anton, Polishko, Tetiana Bas, Guillermo Gabrielli

TL;DR
This paper introduces ZNO-Vision, a comprehensive multimodal benchmark for Ukrainian language understanding across academic and cultural domains, addressing the lack of evaluation tools for low-resource languages.
Contribution
It presents the first Ukrainian multimodal benchmark derived from university exams and evaluates existing models, highlighting performance gaps and cultural knowledge challenges.
Findings
Few models outperform baseline on the benchmark
Performance drops significantly when translating English benchmarks to Ukrainian
Models show limited understanding of Ukrainian cultural content
Abstract
While the evaluation of multimodal English-centric models is an active area of research with numerous benchmarks, there is a profound lack of benchmarks or evaluation suites for low- and mid-resource languages. We introduce ZNO-Vision, a comprehensive multimodal Ukrainian-centric benchmark derived from standardized university entrance examination (ZNO). The benchmark consists of over 4,300 expert-crafted questions spanning 12 academic disciplines, including mathematics, physics, chemistry, and humanities. We evaluated the performance of both open-source models and API providers, finding that only a handful of models performed above baseline. Alongside the new benchmark, we performed the first evaluation study of multimodal text generation for the Ukrainian language: we measured caption generation quality on the Multi30K-UK dataset, translated the VQA benchmark into Ukrainian, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
Topicslinguistics and terminology studies · Language, Communication, and Linguistic Studies
