Improving the Diproche CNL through Autoformalization via Large Language   Models

Merlin Carl (Europa-Universit\"at Flensburg)

arXiv:2303.17513·cs.CL·April 11, 2024·ThEdu@CADE·1 cites

Improving the Diproche CNL through Autoformalization via Large Language Models

Merlin Carl (Europa-Universit\"at Flensburg)

PDF

Open Access

TL;DR

This paper investigates enhancing the Diproche proof checker by leveraging large language models for autoformalization, aiming to improve proof verification in educational settings involving controlled natural language.

Contribution

It introduces a novel approach of using large language models to autoformalize controlled natural language proofs in Diproche, improving the system's flexibility and potential accuracy.

Findings

01

Encouraging initial results in autoformalization accuracy

02

Potential for improved proof checking in didactical contexts

03

Demonstrates feasibility of LLM-based formalization for proof systems

Abstract

The Diproche system is an automated proof checker for texts written in a controlled fragment of German, designed for didactical applications in classes introducing students to proofs for the first time. The first version of the system used a controlled natural language for which a Prolog formalization routine was written. In this paper, we explore the possibility of prompting large language models for autoformalization in the context of Diproche, with encouraging first results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing · Topic Modeling