Geoparsing: Diagram Parsing for Plane and Solid Geometry with a Unified Formal Language

Peijie Wang; Ming-Liang Zhang; Jun Cao; Chao Deng; Dekang Ran; Hongda Sun; Pi Bu; Xuan Zhang; Yingyao Wang; Jun Song; Bo Zheng; Fei Yin; Cheng-Lin Liu

arXiv:2604.11600·cs.CV·April 17, 2026

Geoparsing: Diagram Parsing for Plane and Solid Geometry with a Unified Formal Language

Peijie Wang, Ming-Liang Zhang, Jun Cao, Chao Deng, Dekang Ran, Hongda Sun, Pi Bu, Xuan Zhang, Yingyao Wang, Jun Song, Bo Zheng, Fei Yin, Cheng-Lin Liu

PDF

1 Repo 1 Models 1 Datasets

TL;DR

This paper introduces a unified formal language for plane and solid geometry, along with a large dataset and training paradigm, to improve geometric reasoning in multimodal large language models.

Contribution

It presents a new formal language, a large-scale dataset, and a training method that enhances geometric understanding and reasoning in multimodal models.

Findings

01

Achieved state-of-the-art parsing performance.

02

Parsed formal descriptions significantly improve geometry reasoning tasks.

03

Constructed GDP-29K dataset with 29,000 geometry samples.

Abstract

Multimodal Large Language Models (MLLMs) have achieved remarkable progress but continue to struggle with geometric reasoning, primarily due to the perception bottleneck regarding fine-grained visual elements. While formal languages have aided plane geometry understanding, solid geometry which requires spatial understanding remains largely unexplored. In this paper, we address this challenge by designing a unified formal language that integrates plane and solid geometry, comprehensively covering geometric structures and semantic relations. We construct GDP-29K, a large-scale dataset comprising 20k plane and 9k solid geometry samples collected from diverse real-world sources, each paired with its ground-truth formal description. To ensure syntactic correctness and geometric consistency, we propose a training paradigm that combines Supervised Fine-Tuning with Reinforcement Learning via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

null
github

Models

🤗
PeijieWang/GDP-4B
model· 47 dl
47 dl

Datasets

PeijieWang/GDP29K
dataset· 200 dl
200 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.