Solving the multiplication problem of a large language model system   using a graph-based method

Turker Tuncer; Sengul Dogan; Mehmet Baygin; Prabal Datta; Barua; Abdul Hafeez-Baig; Ru-San Tan; Subrata Chakraborty; U.; Rajendra Acharya

arXiv:2310.13016·cs.OH·October 23, 2023·1 cites

Solving the multiplication problem of a large language model system using a graph-based method

Turker Tuncer, Sengul Dogan, Mehmet Baygin, Prabal Datta, Barua, Abdul Hafeez-Baig, Ru-San Tan, Subrata Chakraborty, U., Rajendra Acharya

PDF

Open Access

TL;DR

This paper introduces a graph-based multiplication algorithm that significantly improves the accuracy of large number multiplication in GPT-based models, achieving perfect results on extensive tests by mimicking human-like numerical operations.

Contribution

The paper presents a novel graph-based multiplication method that enhances large number multiplication accuracy in language models, bridging the gap between human intuition and AI algorithms.

Findings

01

Achieved 100% accuracy on 1,000,000 large number multiplications.

02

Effectively solved multiplication challenges in GPT-based and other large language models.

03

Demonstrated the importance of human-inspired insights in AI algorithm design.

Abstract

The generative pre-trained transformer (GPT)-based chatbot software ChatGPT possesses excellent natural language processing capabilities but is inadequate for solving arithmetic problems, especially multiplication. Its GPT structure uses a computational graph for multiplication, which has limited accuracy beyond simple multiplication operations. We developed a graph-based multiplication algorithm that emulated human-like numerical operations by incorporating a 10k operator, where k represents the maximum power to base 10 of the larger of two input numbers. Our proposed algorithm attained 100% accuracy for 1,000,000 large number multiplication tasks, effectively solving the multiplication challenge of GPT-based and other large language models. Our work highlights the importance of blending simple human insights into the design of artificial intelligence algorithms. Keywords: Graph-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFinTech, Crowdfunding, Digital Finance

MethodsMulti-Head Attention · Attention Is All You Need · Cosine Annealing · Discriminative Fine-Tuning · Layer Normalization · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Cosine Annealing · Linear Layer · Attention Dropout · Softmax