Tabular Transfer Learning via Prompting LLMs

Jaehyun Nam; Woomin Song; Seong Hyeon Park; Jihoon Tack; Sukmin Yun,; Jaehyung Kim; Kyu Hwan Oh; Jinwoo Shin

arXiv:2408.11063·cs.CL·August 22, 2024

Tabular Transfer Learning via Prompting LLMs

Jaehyun Nam, Woomin Song, Seong Hyeon Park, Jihoon Tack, Sukmin Yun,, Jaehyung Kim, Kyu Hwan Oh, Jinwoo Shin

PDF

Open Access

TL;DR

This paper introduces P2T, a novel framework leveraging large language models for transfer learning on tabular data, effectively handling heterogeneous datasets and improving performance on benchmark tasks.

Contribution

It proposes a new method that uses LLMs to perform transfer learning on tabular data with different formats, addressing a less explored area.

Findings

01

P2T outperforms previous methods on various benchmarks.

02

The framework effectively utilizes unlabeled and heterogeneous source data.

03

Prompt-based transfer learning improves tabular task performance.

Abstract

Learning with a limited number of labeled data is a central problem in real-world applications of machine learning, as it is often expensive to obtain annotations. To deal with the scarcity of labeled data, transfer learning is a conventional approach; it suggests to learn a transferable knowledge by training a neural network from multiple other sources. In this paper, we investigate transfer learning of tabular tasks, which has been less studied and successful in the literature, compared to other domains, e.g., vision and language. This is because tables are inherently heterogeneous, i.e., they contain different columns and feature spaces, making transfer learning difficult. On the other hand, recent advances in natural language processing suggest that the label scarcity issue can be mitigated by utilizing in-context learning capability of large language models (LLMs). Inspired by this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Topic Modeling