DiagramEval: Evaluating LLM-Generated Diagrams via Graphs
Chumeng Liang, Jiaxuan You

TL;DR
DiagramEval introduces a graph-based evaluation metric for assessing the quality of diagrams generated by large language models, focusing on structure and connections to improve interpretability and accuracy.
Contribution
This paper presents a novel graph-based evaluation metric for LLM-generated diagrams, enabling more accurate and explainable assessment of diagram quality.
Findings
Effective evaluation of LLM-generated diagrams using graph-based metrics
Quantitative validation on recent research literature diagrams
Enhanced explainability provides insights into diagram characteristics
Abstract
Diagrams play a central role in research papers for conveying ideas, yet they are often notoriously complex and labor-intensive to create. Although diagrams are presented as images, standard image generative models struggle to produce clear diagrams with well-defined structure. We argue that a promising direction is to generate demonstration diagrams directly in textual form as SVGs, which can leverage recent advances in large language models (LLMs). However, due to the complexity of components and the multimodal nature of diagrams, sufficiently discriminative and explainable metrics for evaluating the quality of LLM-generated diagrams remain lacking. In this paper, we propose DiagramEval, a novel evaluation metric designed to assess demonstration diagrams generated by LLMs. Specifically, DiagramEval conceptualizes diagrams as graphs, treating text elements as nodes and their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
