InterChart: Benchmarking Visual Reasoning Across Decomposed and Distributed Chart Information

Anirudh Iyengar Kaniyar Narayana Iyengar; Srija Mukhopadhyay; Adnan Qidwai; Shubhankar Singh; Dan Roth; Vivek Gupta

arXiv:2508.07630·cs.CL·May 4, 2026

InterChart: Benchmarking Visual Reasoning Across Decomposed and Distributed Chart Information

Anirudh Iyengar Kaniyar Narayana Iyengar, Srija Mukhopadhyay, Adnan Qidwai, Shubhankar Singh, Dan Roth, Vivek Gupta

PDF

1 Datasets

TL;DR

InterChart is a comprehensive benchmark designed to evaluate vision-language models' ability to reason across multiple related charts, highlighting their limitations in complex, multi-visual reasoning tasks.

Contribution

The paper introduces InterChart, a new benchmark with diverse, multi-tiered chart reasoning tasks to systematically assess and challenge current vision-language models.

Findings

01

Models show significant accuracy drops with increased chart complexity.

02

Decomposing multi-entity charts into simpler units improves model performance.

03

InterChart exposes systematic limitations in current multimodal reasoning models.

Abstract

We introduce InterChart, a diagnostic benchmark that evaluates how well vision-language models (VLMs) reason across multiple related charts, a task central to real-world applications such as scientific reporting, financial analysis, and public policy dashboards. Unlike prior benchmarks focusing on isolated, visually uniform charts, InterChart challenges models with diverse question types ranging from entity inference and trend correlation to numerical estimation and abstract multi-step reasoning grounded in 2-3 thematically or structurally related charts. We organize the benchmark into three tiers of increasing difficulty: (1) factual reasoning over individual charts, (2) integrative analysis across synthetically aligned chart sets, and (3) semantic inference over visually complex, real-world chart pairs. Our evaluation of state-of-the-art open- and closed-source VLMs reveals consistent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

interchart/Interchart
dataset· 311 dl
311 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.