TL;DR
This paper introduces Infogen, a novel AI framework that generates complex, multi-chart statistical infographics from documents, addressing a gap in visual data summarization and understanding.
Contribution
The paper presents Infogen, a two-stage LLM-based system and Infodat, a benchmark dataset for generating detailed infographic metadata from text documents.
Findings
Infogen outperforms existing models in infographic generation accuracy.
Infodat enables effective training and evaluation of text-to-infographic systems.
The framework produces contextually accurate and visually aligned complex infographics.
Abstract
Statistical infographics are powerful tools that simplify complex data into visually engaging and easy-to-understand formats. Despite advancements in AI, particularly with LLMs, existing efforts have been limited to generating simple charts, with no prior work addressing the creation of complex infographics from text-heavy documents that demand a deep understanding of the content. We address this gap by introducing the task of generating statistical infographics composed of multiple sub-charts (e.g., line, bar, pie) that are contextually accurate, insightful, and visually aligned. To achieve this, we define infographic metadata that includes its title and textual insights, along with sub-chart-specific details such as their corresponding data and alignment. We also present Infodat, the first benchmark dataset for text-to-infographic metadata generation, where each sample links a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
