TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios
Xiaokang Zhang, Sijia Luo, Bohan Zhang, Zeyao Ma, Jing Zhang, Yang Li,, Guanlin Li, Zijun Yao, Kangli Xu, Jinchang Zhou, Daniel Zhang-Li, Jifan Yu,, Shu Zhao, Juanzi Li, Jie Tang

TL;DR
TableLLM is a specialized 8-billion-parameter LLM designed for effective tabular data manipulation in office scenarios, utilizing novel training strategies and benchmarks to outperform existing models.
Contribution
The paper introduces TableLLM, a purpose-built LLM for tabular data tasks, with a new training method and comprehensive evaluation benchmarks.
Findings
TableLLM outperforms existing LLMs in tabular data tasks.
The proposed training strategies improve reasoning capabilities.
Public release of model, code, and benchmarks.
Abstract
We introduce TableLLM, a robust large language model (LLM) with 8 billion parameters, purpose-built for proficiently handling tabular data manipulation tasks, whether they are embedded within documents or spreadsheets, catering to real-world office scenarios. We propose a distant supervision method for training, which comprises a reasoning process extension strategy, aiding in training LLMs to understand reasoning patterns more effectively as well as a cross-way validation strategy, ensuring the quality of the automatically generated data. To evaluate the performance of TableLLM, we have crafted benchmarks tailored to address both document and spreadsheet formats as well as constructed a well-organized evaluation pipeline capable of handling both scenarios. Thorough evaluations underscore the advantages of TableLLM when compared to various existing general-purpose and tabular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗RUCKBReasoning/TableLLM-13bmodel· 386 dl· ♡ 32386 dl♡ 32
- 🤗RUCKBReasoning/TableLLM-7bmodel· 24 dl· ♡ 1624 dl♡ 16
- 🤗blockblockblock/TableLLM-13b-bpw2.5model· 14 dl14 dl
- 🤗blockblockblock/TableLLM-13b-bpw3model· 5 dl5 dl
- 🤗blockblockblock/TableLLM-13b-bpw3.5model· 3 dl3 dl
- 🤗blockblockblock/TableLLM-13b-bpw3.7model· 4 dl4 dl
- 🤗blockblockblock/TableLLM-13b-bpw4model· 5 dl5 dl
- 🤗blockblockblock/TableLLM-13b-bpw4.2model· 4 dl4 dl
- 🤗blockblockblock/TableLLM-13b-bpw4.4model· 1 dl1 dl
- 🤗blockblockblock/TableLLM-13b-bpw4.6model· 4 dl4 dl
Videos
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Quality and Management · Business Process Modeling and Analysis
