From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models
Kung-Hsiang Huang, Hou Pong Chan, Yi R. Fung, Haoyi Qiu, Mingyang, Zhou, Shafiq Joty, Shih-Fu Chang, Heng Ji

TL;DR
This survey reviews recent advancements in automatic chart understanding driven by large foundation models, highlighting challenges, modeling strategies, and future research directions in the field.
Contribution
It provides a comprehensive overview of recent developments, challenges, and future directions in chart understanding using large foundation models, including evaluation metrics and modeling approaches.
Findings
State-of-the-art performance in chart understanding tasks
Identification of key challenges like domain-specific charts and evaluation metrics
Discussion of tool augmentation techniques for improved performance
Abstract
Data visualization in the form of charts plays a pivotal role in data analysis, offering critical insights and aiding in informed decision-making. Automatic chart understanding has witnessed significant advancements with the rise of large foundation models in recent years. Foundation models, such as large language models, have revolutionized various natural language processing tasks and are increasingly being applied to chart understanding tasks. This survey paper provides a comprehensive overview of the recent developments, challenges, and future directions in chart understanding within the context of these foundation models. We review fundamental building blocks crucial for studying chart understanding tasks. Additionally, we explore various tasks and their evaluation metrics and sources of both charts and textual inputs. Various modeling strategies are then examined, encompassing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReservoir Engineering and Simulation Methods
