KELPS: A Framework for Verified Multi-Language Autoformalization via Semantic-Syntactic Alignment

Jiyao Zhang; Chengli Zhong; Hui Xu; Qige Li; Yi Zhou

arXiv:2507.08665·cs.CL·July 14, 2025

KELPS: A Framework for Verified Multi-Language Autoformalization via Semantic-Syntactic Alignment

Jiyao Zhang, Chengli Zhong, Hui Xu, Qige Li, Yi Zhou

PDF

TL;DR

KELPS is a neuro-symbolic framework that translates informal mathematical language into multiple formal languages using a novel intermediate language, significantly improving formalization accuracy and creating a large parallel corpus.

Contribution

The paper introduces KELPS, a new neuro-symbolic framework that leverages a novel language and rule-based translation to enhance multi-language formalization of informal mathematics.

Findings

01

Achieved 88.9% syntactic accuracy on MiniF2F.

02

Generated over 60,000 problems in a parallel corpus.

03

Outperformed state-of-the-art models like Deepseek-V3 and Herald.

Abstract

Modern large language models (LLMs) show promising progress in formalizing informal mathematics into machine-verifiable theorems. However, these methods still face bottlenecks due to the limited quantity and quality of multilingual parallel corpora. In this paper, we propose a novel neuro-symbolic framework KELPS (Knowledge-Equation based Logical Processing System) to address these problems. KELPS is an iterative framework for translating, synthesizing, and filtering informal data into multiple formal languages (Lean, Coq, and Isabelle). First, we translate natural language into Knowledge Equations (KEs), a novel language that we designed, theoretically grounded in assertional logic. Next, we convert them to target languages through rigorously defined rules that preserve both syntactic structure and semantic meaning. This process yielded a parallel corpus of over 60,000 problems. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.