Tabular Transfer Learning via Prompting LLMs
Jaehyun Nam, Woomin Song, Seong Hyeon Park, Jihoon Tack, Sukmin Yun,, Jaehyung Kim, Kyu Hwan Oh, Jinwoo Shin

TL;DR
This paper introduces P2T, a novel framework leveraging large language models for transfer learning on tabular data, effectively handling heterogeneous datasets and improving performance on benchmark tasks.
Contribution
It proposes a new method that uses LLMs to perform transfer learning on tabular data with different formats, addressing a less explored area.
Findings
P2T outperforms previous methods on various benchmarks.
The framework effectively utilizes unlabeled and heterogeneous source data.
Prompt-based transfer learning improves tabular task performance.
Abstract
Learning with a limited number of labeled data is a central problem in real-world applications of machine learning, as it is often expensive to obtain annotations. To deal with the scarcity of labeled data, transfer learning is a conventional approach; it suggests to learn a transferable knowledge by training a neural network from multiple other sources. In this paper, we investigate transfer learning of tabular tasks, which has been less studied and successful in the literature, compared to other domains, e.g., vision and language. This is because tables are inherently heterogeneous, i.e., they contain different columns and feature spaces, making transfer learning difficult. On the other hand, recent advances in natural language processing suggest that the label scarcity issue can be mitigated by utilizing in-context learning capability of large language models (LLMs). Inspired by this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Topic Modeling
