Are Large Vision Language Models up to the Challenge of Chart Comprehension and Reasoning? An Extensive Investigation into the Capabilities and Limitations of LVLMs
Mohammed Saidul Islam, Raian Rahman, Ahmed Masry, Md Tahmid Rahman, Laskar, Mir Tafseer Nayeem, Enamul Hoque

TL;DR
This paper thoroughly evaluates large vision language models' capabilities in understanding and reasoning about data visualizations like charts, revealing their strengths in generating insights but also their limitations such as hallucinations and data bias.
Contribution
It provides the first comprehensive assessment of LVLMs like GPT-4V and Gemini on chart reasoning tasks, highlighting their performance and challenges in data visualization understanding.
Findings
LVLMs generate fluent, high-level data insights.
LVLMs face issues with hallucinations and factual errors.
Performance varies across different chart types.
Abstract
Natural language is a powerful complementary modality of communication for data visualizations, such as bar and line charts. To facilitate chart-based reasoning using natural language, various downstream tasks have been introduced recently such as chart question answering, chart summarization, and fact-checking with charts. These tasks pose a unique challenge, demanding both vision-language reasoning and a nuanced understanding of chart data tables, visual encodings, and natural language prompts. Despite the recent success of Large Language Models (LLMs) across diverse NLP tasks, their abilities and limitations in the realm of data visualization remain under-explored, possibly due to their lack of multi-modal capabilities. To bridge the gap, this paper presents the first comprehensive evaluation of the recently developed large vision language models (LVLMs) for chart understanding and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech and dialogue systems · Topic Modeling
