Towards Better Understanding Table Instruction Tuning: Decoupling the   Effects from Data versus Models

Naihao Deng; Sheng Zhang; Henghui Zhu; Shuaichen Chang; Jiani Zhang,; Alexander Hanbo Li; Chung-Wei Hang; Hideo Kobayashi; Yiqun Hu; Patrick Ng

arXiv:2501.14717·cs.CL·January 27, 2025

Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models

Naihao Deng, Sheng Zhang, Henghui Zhu, Shuaichen Chang, Jiani Zhang,, Alexander Hanbo Li, Chung-Wei Hang, Hideo Kobayashi, Yiqun Hu, Patrick Ng

PDF

Open Access

TL;DR

This paper investigates how training data and model architecture individually influence the performance of table instruction tuning in large language models, providing insights into their separate effects and trade-offs.

Contribution

It systematically decouples the effects of training data and model architecture on table instruction tuning performance, offering new insights and state-of-the-art results.

Findings

01

Achieved new state-of-the-art on Hitab dataset.

02

Decoupled contributions of data and models through systematic evaluation.

03

Revealed trade-offs between specialization and generalization in table instruction tuning.

Abstract

Recent advances in natural language processing have leveraged instruction tuning to enhance Large Language Models (LLMs) for table-related tasks. However, previous works train different base models with different training data, lacking an apples-to-apples comparison across the result table LLMs. To address this, we fine-tune base models from the Mistral, OLMo, and Phi families on existing public training datasets. Our replication achieves performance on par with or surpassing existing table LLMs, establishing new state-of-the-art performance on Hitab, a table question-answering dataset. More importantly, through systematic out-of-domain evaluation, we decouple the contributions of training data and the base model, providing insight into their individual impacts. In addition, we assess the effects of table-specific instruction tuning on general-purpose benchmarks, revealing trade-offs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline Learning and Analytics · Intelligent Tutoring Systems and Adaptive Learning · Statistics Education and Methodologies

MethodsBalanced Selection