End-to-End Chart Summarization via Visual Chain-of-Thought in Vision-Language Models
Raymond Choi, Frank Burns, Chase Lawrence

TL;DR
This paper presents a novel end-to-end visual chain-of-thought approach for chart summarization using large vision-language models, significantly improving automatic and human evaluation metrics over previous methods.
Contribution
It introduces a visual Chain-of-Thought mechanism with instruction fine-tuning for end-to-end chart summarization, eliminating explicit parsing modules and enhancing reasoning capabilities.
Findings
Outperforms state-of-the-art baselines on Chart-Sum-QA dataset
Achieves higher scores on BLEU, BLEURT, CIDEr, and CS metrics
Demonstrates superior reasoning and matching in human evaluations
Abstract
Automated chart summarization is crucial for enhancing data accessibility and enabling efficient information extraction from visual data. While recent advances in visual-language models (VLMs) have demonstrated promise, existing methods often suffer from limitations in matching the generated summary to the chart data and in reasoning about complex chart patterns. This paper introduces End-to-End Visual Chain-of-Thought (V-CoT) for chart summarization, a novel approach optimized for Large Vision-Language Models (LVLMs). Our method directly trains an LVLM to process chart images and generate textual summaries in an end-to-end fashion, eliminating the need for explicit chart parsing modules. We incorporate a visual Chain-of-Thought mechanism through instruction fine-tuning, implicitly guiding the LVLM to perform visual reasoning steps during summary generation. Evaluated on the large-scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Data Visualization and Analytics · Species Distribution and Climate Change
