ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for   Complicated Chart Reasoning

Renqiu Xia; Bo Zhang; Hancheng Ye; Xiangchao Yan; Qi Liu; Hongbin; Zhou; Zijun Chen; Peng Ye; Min Dou; Botian Shi; Junchi Yan; Yu Qiao

arXiv:2402.12185·cs.CV·April 29, 2025·5 cites

ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

Renqiu Xia, Bo Zhang, Hancheng Ye, Xiangchao Yan, Qi Liu, Hongbin, Zhou, Zijun Chen, Peng Ye, Min Dou, Botian Shi, Junchi Yan, Yu Qiao

PDF

Open Access 2 Repos 1 Datasets

TL;DR

This paper introduces ChartX, a comprehensive benchmark for multi-modal models in chart reasoning, and ChartVLM, a new model that outperforms existing models in interpreting and reasoning with complex visual charts.

Contribution

The paper presents a new multi-modal evaluation set for charts and a novel model, ChartVLM, that excels in chart reasoning tasks, surpassing existing models and approaching GPT-4V performance.

Findings

01

ChartVLM outperforms other models on ChartX benchmark.

02

ChartX covers 18 chart types and 7 tasks, providing a comprehensive evaluation.

03

ChartVLM achieves results comparable to GPT-4V.

Abstract

Recently, many versatile Multi-modal Large Language Models (MLLMs) have emerged continuously. However, their capacity to query information depicted in visual charts and engage in reasoning based on the queried contents remains under-explored. In this paper, to comprehensively and rigorously benchmark the ability of the off-the-shelf MLLMs in the chart domain, we construct ChartX, a multi-modal evaluation set covering 18 chart types, 7 chart tasks, 22 disciplinary topics, and high-quality chart data. Besides, we develop ChartVLM to offer a new perspective on handling multi-modal tasks that strongly depend on interpretable patterns, such as reasoning tasks in the field of charts or geometric images. We evaluate the chart-related ability of mainstream MLLMs and our ChartVLM on the proposed ChartX evaluation set. Extensive experiments demonstrate that ChartVLM surpasses both versatile and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

InternScience/ChartX
dataset· 270 dl
270 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Formal Methods in Verification · Natural Language Processing Techniques

MethodsSparse Evolutionary Training