OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step

Owen Dugan; Donato Manuel Jimenez Beneto; Charlotte Loh; Zhuo Chen,; Rumen Dangovski; Marin Solja\v{c}i\'c

arXiv:2406.06576·cs.CL·September 4, 2024

OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step

Owen Dugan, Donato Manuel Jimenez Beneto, Charlotte Loh, Zhuo Chen,, Rumen Dangovski, Marin Solja\v{c}i\'c

PDF

Open Access 1 Video

TL;DR

OccamLLM introduces a novel framework that enables exact arithmetic in a single step using a symbolic approach, significantly improving speed, security, and accuracy in LLM-based arithmetic operations and benchmarks.

Contribution

The paper presents a new method combining LLM hidden states with symbolic architecture to perform exact arithmetic instantly, surpassing existing models in accuracy and efficiency.

Findings

01

Achieves 100% accuracy on basic arithmetic operations

02

Outperforms GPT-4 on mathematical benchmarks

03

Provides faster and more secure arithmetic computations

Abstract

Despite significant advancements in text generation and reasoning, Large Language Models (LLMs) still face challenges in accurately performing complex arithmetic operations. Language model systems often enable LLMs to generate code for arithmetic operations to achieve accurate calculations. However, this approach compromises speed and security, and fine-tuning risks the language model losing prior capabilities. We propose a framework that enables exact arithmetic in a single autoregressive step, providing faster, more secure, and more interpretable LLM systems with arithmetic capabilities. We use the hidden states of a LLM to control a symbolic architecture that performs arithmetic. Our implementation using Llama 3 with OccamNet as a symbolic model (OccamLlama) achieves 100\% accuracy on single arithmetic operations ( $+, -, \times, \div, sin, cos, lo g, exp,$ ), outperforming…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Byte Pair Encoding · Adam · Attention Dropout · Weight Decay · Linear Warmup With Cosine Annealing · Linear Layer · Multi-Head Attention