Can a Large Language Model Learn Matrix Functions In Context?

Paimon Goulart; Evangelos E. Papalexakis

arXiv:2411.15675·cs.LG·November 26, 2024

Can a Large Language Model Learn Matrix Functions In Context?

Paimon Goulart, Evangelos E. Papalexakis

PDF

Open Access 1 Repo

TL;DR

This paper investigates the ability of Large Language Models to learn and perform matrix functions, especially those involving Singular Value Decomposition, demonstrating their potential for complex numerical tasks in in-context learning.

Contribution

The study shows that LLMs can effectively learn complex matrix functions and outperform traditional models on challenging tasks, highlighting their scalability and efficiency in high-dimensional computations.

Findings

01

LLMs perform well on complex matrix functions involving SVD.

02

They outperform classical models on top-k singular value tasks.

03

High accuracy maintained as matrix size increases.

Abstract

Large Language Models (LLMs) have demonstrated the ability to solve complex tasks through In-Context Learning (ICL), where models learn from a few input-output pairs without explicit fine-tuning. In this paper, we explore the capacity of LLMs to solve non-linear numerical computations, with specific emphasis on functions of the Singular Value Decomposition. Our experiments show that while LLMs perform comparably to traditional models such as Stochastic Gradient Descent (SGD) based Linear Regression and Neural Networks (NN) for simpler tasks, they outperform these models on more complex tasks, particularly in the case of top-k Singular Values. Furthermore, LLMs demonstrate strong scalability, maintaining high accuracy even as the matrix size increases. Additionally, we found that LLMs can achieve high accuracy with minimal prior examples, converging quickly and avoiding the overfitting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Pie115/Learning-Matrix-Functions-In-Context
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsLinear Regression · Focus