Transfer Learning of Tabular Data by Finetuning Large Language Models

Shourav B. Rabbani; Ibna Kowsar; Manar D. Samad

arXiv:2501.06863·cs.LG·January 14, 2025

Transfer Learning of Tabular Data by Finetuning Large Language Models

Shourav B. Rabbani, Ibna Kowsar, Manar D. Samad

PDF

TL;DR

This paper explores finetuning large language models for tabular data classification, demonstrating superior performance and efficiency on small-feature datasets through transfer learning.

Contribution

It introduces an end-to-end finetuning method for LLMs tailored to tabular data, enabling effective transfer learning without existing large pre-trained models.

Findings

01

Outperforms state-of-the-art methods on small-feature datasets

02

Uses less computational cost than other deep learning approaches

03

Achieves competitive or superior classification accuracy

Abstract

Despite the artificial intelligence (AI) revolution, deep learning has yet to achieve much success with tabular data due to heterogeneous feature space and limited sample sizes without viable transfer learning. The new era of generative AI, powered by large language models (LLM), brings unprecedented learning opportunities to diverse data and domains. This paper investigates the effectiveness of an LLM application programming interface (API) and transfer learning of LLM in tabular data classification. LLM APIs respond to input text prompts with tokenized data and instructions, whereas transfer learning finetunes an LLM for a target classification task. This paper proposes an end-to-end finetuning of LLM to demonstrate cross-data transfer learning on ten benchmark data sets when large pre-trained tabular data models do not exist to facilitate transfer learning. The proposed LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.