Large Language Models as Universal Predictors? An Empirical Study on Small Tabular Datasets

Nikolaos Pavlidis; Vasilis Perifanis; Symeon Symeonidis; Pavlos S. Efraimidis

arXiv:2508.17391·cs.AI·August 26, 2025

Large Language Models as Universal Predictors? An Empirical Study on Small Tabular Datasets

Nikolaos Pavlidis, Vasilis Perifanis, Symeon Symeonidis, Pavlos S. Efraimidis

PDF

TL;DR

This study empirically evaluates large language models' ability to perform classification, regression, and clustering on small tabular datasets, highlighting their strengths in classification and limitations in regression and clustering.

Contribution

It demonstrates that LLMs can serve as practical zero-training predictors for structured data, especially in classification tasks, and analyzes factors affecting their performance.

Findings

01

LLMs perform well in classification with limited data

02

Regression performance of LLMs is poor compared to traditional ML models

03

Clustering results are limited due to lack of genuine in-context learning

Abstract

Large Language Models (LLMs), originally developed for natural language processing (NLP), have demonstrated the potential to generalize across modalities and domains. With their in-context learning (ICL) capabilities, LLMs can perform predictive tasks over structured inputs without explicit fine-tuning on downstream tasks. In this work, we investigate the empirical function approximation capability of LLMs on small-scale structured datasets for classification, regression and clustering tasks. We evaluate the performance of state-of-the-art LLMs (GPT-5, GPT-4o, GPT-o3, Gemini-2.5-Flash, DeepSeek-R1) under few-shot prompting and compare them against established machine learning (ML) baselines, including linear models, ensemble methods and tabular foundation models (TFMs). Our results show that LLMs achieve strong performance in classification tasks under limited data availability,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.