POLYCHARTQA: Benchmarking Large Vision-Language Models with Multilingual Chart Question Answering

Yichen Xu; Liangyu Chen; Liang Zhang; Jianzhe Ma; Wenxuan Wang; Qin Jin

arXiv:2507.11939·cs.CL·January 9, 2026

POLYCHARTQA: Benchmarking Large Vision-Language Models with Multilingual Chart Question Answering

Yichen Xu, Liangyu Chen, Liang Zhang, Jianzhe Ma, Wenxuan Wang, Qin Jin

PDF

Open Access

TL;DR

PolyChartQA is a comprehensive multilingual benchmark for chart question answering, addressing the lack of non-English chart understanding datasets and evaluating current models' performance across diverse languages.

Contribution

We introduce PolyChartQA, the first large-scale multilingual chart QA benchmark, and demonstrate its utility in evaluating and improving multilingual vision-language models.

Findings

01

Significant performance gap between English and other languages in chart understanding.

02

Fine-tuning on PolyChartQA-Train improves multilingual chart comprehension.

03

Benchmark enables development of more inclusive vision-language models.

Abstract

Charts are a universally adopted medium for data communication, yet existing chart understanding benchmarks are overwhelmingly English-centric, limiting their accessibility and relevance to global audiences. To address this limitation, we introduce PolyChartQA, the first large-scale multilingual benchmark for chart question answering, comprising 22,606 charts and 26,151 QA pairs across 10 diverse languages. PolyChartQA is constructed through a scalable pipeline that enables efficient multilingual chart generation via data translation and code reuse, supported by LLM-based translation and rigorous quality control. We systematically evaluate multilingual chart understanding with PolyChartQA on state-of-the-art LVLMs and reveal a significant performance gap between English and other languages, particularly low-resource ones. Additionally, we introduce a companion multilingual chart…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Topic Modeling