DiagramEval: Evaluating LLM-Generated Diagrams via Graphs

Chumeng Liang; Jiaxuan You

arXiv:2510.25761·cs.CL·November 3, 2025

DiagramEval: Evaluating LLM-Generated Diagrams via Graphs

Chumeng Liang, Jiaxuan You

PDF

TL;DR

DiagramEval introduces a graph-based evaluation metric for assessing the quality of diagrams generated by large language models, focusing on structure and connections to improve interpretability and accuracy.

Contribution

This paper presents a novel graph-based evaluation metric for LLM-generated diagrams, enabling more accurate and explainable assessment of diagram quality.

Findings

01

Effective evaluation of LLM-generated diagrams using graph-based metrics

02

Quantitative validation on recent research literature diagrams

03

Enhanced explainability provides insights into diagram characteristics

Abstract

Diagrams play a central role in research papers for conveying ideas, yet they are often notoriously complex and labor-intensive to create. Although diagrams are presented as images, standard image generative models struggle to produce clear diagrams with well-defined structure. We argue that a promising direction is to generate demonstration diagrams directly in textual form as SVGs, which can leverage recent advances in large language models (LLMs). However, due to the complexity of components and the multimodal nature of diagrams, sufficiently discriminative and explainable metrics for evaluating the quality of LLM-generated diagrams remain lacking. In this paper, we propose DiagramEval, a novel evaluation metric designed to assess demonstration diagrams generated by LLMs. Specifically, DiagramEval conceptualizes diagrams as graphs, treating text elements as nodes and their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.