Scientific Graphics Program Synthesis via Dual Self-Consistency Reinforcement Learning

Juekai Lin; Yun Zhu; Honglin Lin; Sijing Li; Tianwei Lin; Zheng Liu; Xiaoyang Wang; Wenqiao Zhang; Lijun Wu

arXiv:2604.06079·cs.CV·April 8, 2026

Scientific Graphics Program Synthesis via Dual Self-Consistency Reinforcement Learning

Juekai Lin, Yun Zhu, Honglin Lin, Sijing Li, Tianwei Lin, Zheng Liu, Xiaoyang Wang, Wenqiao Zhang, Lijun Wu

PDF

TL;DR

This paper introduces a new dataset, benchmark, and a dual self-consistency reinforcement learning method for scientific graphics program synthesis, significantly improving the accuracy and fidelity of generated TikZ code.

Contribution

The paper presents a high-quality dataset, a comprehensive benchmark, and a novel reinforcement learning paradigm for improved scientific graphics synthesis.

Findings

01

SciTikZer-8B outperforms proprietary and large models in graphics synthesis.

02

The dataset covers 11 scientific disciplines with high-quality, executable data.

03

The reinforcement learning approach enhances structural and visual fidelity.

Abstract

Graphics Program Synthesis is pivotal for interpreting and editing visual data, effectively facilitating the reverse-engineering of static visuals into editable TikZ code. While TikZ is the de facto standard for scientific schematics due to its programmatic flexibility, its requirement for rigorous spatial precision presents a significant challenge for Multimodal Large Language Models. Progress is currently stifled by two primary gaps: (1) Data Quality Gap: existing image-TikZ corpora often lack strict executability and reliable visual alignment; (2) Evaluation Gap: a lack of benchmarks for both structural and visual fidelity. To address these, we present a closed-loop framework featuring: SciTikZ-230K, a large-scale, high-quality dataset from our Execution-Centric Data Engine covering 11 diverse scientific disciplines; SciTikZ-Bench, a multifaceted benchmark spanning from basic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.