Paper2SysArch: Structure-Constrained System Architecture Generation from Scientific Papers
Ziyi Guo, Zhou Liu, Wentao Zhang

TL;DR
This paper introduces a large-scale benchmark for automated diagram generation from scientific papers and proposes Paper2SysArch, an end-to-end system that effectively converts papers into structured diagrams, advancing research in scientific visualization.
Contribution
It establishes the first comprehensive benchmark with 3,000 papers and ground-truth diagrams, and presents Paper2SysArch, a novel system leveraging multi-agent collaboration for diagram generation.
Findings
Achieved a composite score of 69.0 on complex papers.
Created the first benchmark with 3,000 paper-diagram pairs.
Demonstrated the effectiveness of multi-agent collaboration in diagram generation.
Abstract
The manual creation of system architecture diagrams for scientific papers is a time-consuming and subjective process, while existing generative models lack the necessary structural control and semantic understanding for this task. A primary obstacle hindering research and development in this domain has been the profound lack of a standardized benchmark to quantitatively evaluate the automated generation of diagrams from text. To address this critical gap, we introduce a novel and comprehensive benchmark, the first of its kind, designed to catalyze progress in automated scientific visualization. It consists of 3,000 research papers paired with their corresponding high-quality ground-truth diagrams and is accompanied by a three-tiered evaluation metric assessing semantic accuracy, layout coherence, and visual quality. Furthermore, to establish a strong baseline on this new benchmark, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Machine Learning in Materials Science · Scientific Computing and Data Management
