Enhancing Mathematical Reasoning in LLMs with Background Operators

Jiajun Chen; Yik-Cheung Tam

arXiv:2412.04110·cs.AI·December 6, 2024

Enhancing Mathematical Reasoning in LLMs with Background Operators

Jiajun Chen, Yik-Cheung Tam

PDF

Open Access 1 Datasets

TL;DR

This paper introduces background operators and a Prolog-based approach to improve mathematical reasoning in large language models, achieving high accuracy and expanding solution coverage through self-training and data augmentation.

Contribution

It presents a novel method using background mathematical predicates and Prolog solutions, combined with self-training, to enhance reasoning capabilities in LLMs.

Findings

01

Achieved 84.6% accuracy with self-training on the cross-validated set.

02

Improved solution coverage by incorporating background predicates into prompts.

03

Successfully generated new, fully computable solutions for unseen problems.

Abstract

We propose utilizing background operators for mathematical reasoning in large language models (LLMs). To achieve this, we define a set of fundamental mathematical predicates as the basic building blocks. For each mathematical problem, we develop a Prolog solution that includes problem-specific predicates and intermediate predicates derived from these background operators, ensuring that each solution adheres to the defined operator set. We introduce the MATH-Prolog corpus, which is derived from the counting and probability categories of the MATH corpus. For efficient data augmentation, we apply K-fold cross-validated self-training. This method incrementally generates new Prolog solutions for each fold, incorporating those verified as correct into the training set throughout the model training process. Our experimental results demonstrate that 5-fold crossvalidated self-training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

wilsontam/gsm8k-prolog-test
dataset· 10 dl
10 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematics, Computing, and Information Processing · Intelligent Tutoring Systems and Adaptive Learning · Open Education and E-Learning

MethodsSparse Evolutionary Training