Strategies for Improving NL-to-FOL Translation with LLMs: Data   Generation, Incremental Fine-Tuning, and Verification

Ramya Keerthy Thatikonda; Jiuzhou Han; Wray Buntine; Ehsan Shareghi

arXiv:2409.16461·cs.CL·September 26, 2024

Strategies for Improving NL-to-FOL Translation with LLMs: Data Generation, Incremental Fine-Tuning, and Verification

Ramya Keerthy Thatikonda, Jiuzhou Han, Wray Buntine, Ehsan Shareghi

PDF

Open Access 1 Repo

TL;DR

This paper enhances natural language to first-order logic translation in LLMs by creating high-quality datasets, introducing incremental fine-tuning, and developing verification methods, leading to state-of-the-art results in logical reasoning tasks.

Contribution

It introduces ProofFOL, a high-quality FOL dataset, and an incremental framework with data augmentation and verification to improve translation accuracy in smaller LLMs.

Findings

01

Significant performance gains on ProofWriter and ProntoQA datasets.

02

ProofFOL enables smaller models to outperform larger ones.

03

Verification reduces translation errors effectively.

Abstract

Logical reasoning is a fundamental task in natural language processing that presents significant challenges to Large Language Models (LLMs). The inherent characteristics of logical reasoning makes it well-suited for symbolic representations such as first-order logic (FOL). Research in symbolic logical reasoning explored FOL generation using state-of-the-art LLMs (i.e., GPT-4) to produce FOL translations of natural language (NL) statements, but errors in translation are usually not the focus. We address this by categorizing the translation errors in FOL statements generated by LLMs. To make progress towards improving the quality of FOL translations for smaller language models such as LLaMA-2 13B and Mistral 7B, we create ProofFOL, a high-quality FOL-annotated subset of ProofWriter dataset using GPT-4o. The models fine-tuned on this silver standard data achieve a significant gain in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RamyaKeerthy/Translation-NL2FOL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques