Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey
Xi Fang, Weijie Xu, Fiona Anting Tan, Jiani Zhang, Ziqing Hu, Yanjun, Qi, Scott Nickleach, Diego Socolinsky, Srinivasan Sengamedu, Christos, Faloutsos

TL;DR
This survey reviews recent advances in applying large language models to tabular data tasks, including prediction, synthesis, and understanding, highlighting techniques, datasets, metrics, and future research directions.
Contribution
It provides a comprehensive taxonomy and comparison of methods, datasets, and challenges in LLM-based tabular data modeling, filling a gap in current literature.
Findings
Identifies key datasets and metrics used in the field.
Highlights strengths and limitations of current approaches.
Suggests unexplored areas and future research directions.
Abstract
Recent breakthroughs in large language modeling have facilitated rigorous exploration of their application in diverse tasks related to tabular data modeling, such as prediction, tabular data synthesis, question answering, and table understanding. Each task presents unique challenges and opportunities. However, there is currently a lack of comprehensive review that summarizes and compares the key techniques, metrics, datasets, models, and optimization approaches in this research domain. This survey aims to address this gap by consolidating recent progress in these areas, offering a thorough survey and taxonomy of the datasets, metrics, and methodologies utilized. It identifies strengths, limitations, unexplored territories, and gaps in the existing literature, while providing some insights for future research directions in this vital and rapidly evolving field. It also provides relevant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
