EvoChart: A Benchmark and a Self-Training Approach Towards Real-World   Chart Understanding

Muye Huang; Han Lai; Xinyu Zhang; Wenjun Wu; Jie Ma; Lingling Zhang,; Jun Liu

arXiv:2409.01577·cs.CV·January 14, 2025

EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding

Muye Huang, Han Lai, Xinyu Zhang, Wenjun Wu, Jie Ma, Lingling Zhang,, Jun Liu

PDF

Open Access 1 Repo 1 Models 1 Datasets 1 Video

TL;DR

EvoChart introduces a self-training data synthesis method and a comprehensive benchmark to improve and evaluate real-world chart understanding capabilities of visual language models, addressing data scarcity and evaluation gaps.

Contribution

The paper presents EvoChart, a novel self-training approach for synthetic chart data generation and a new benchmark, EvoChart-QA, for assessing model performance in real-world chart comprehension.

Findings

01

EvoChart significantly improves VLMs' accuracy on real-world chart tasks.

02

GPT-4o achieves only 49.8% accuracy on the benchmark.

03

Open-source VLMs' performance is boosted to 54.2% with EvoChart.

Abstract

Chart understanding enables automated data analysis for humans, which requires models to achieve highly accurate visual comprehension. While existing Visual Language Models (VLMs) have shown progress in chart understanding, the lack of high-quality training data and comprehensive evaluation benchmarks hinders VLM chart comprehension. In this paper, we introduce EvoChart, a novel self-training method for generating synthetic chart data to enhance VLMs' capabilities in real-world chart comprehension. We also propose EvoChart-QA, a noval benchmark for measuring models' chart comprehension abilities in real-world scenarios. Specifically, EvoChart is a unique self-training data synthesis approach that simultaneously produces high-quality training corpus and a high-performance chart understanding model. EvoChart-QA consists of 650 distinct real-world charts collected from 140 different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MuyeHuang/EvoChart
noneOfficial

Models

🤗
MuyeHuang/EvoChart
model· 2 dl· ♡ 1
2 dl♡ 1

Datasets

MuyeHuang/EvoChart-QA-Benchmark
dataset· 30 dl
30 dl

Videos

EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding· underline

Taxonomy

TopicsTime Series Analysis and Forecasting

MethodsFocus