A Closer Look into LLMs for Table Understanding
Jia Wang, Chuanyu Qin, Mingyu Zheng, Qingyi Si, Peize Li, Zheng Lin

TL;DR
This paper empirically investigates how large language models understand tabular data, revealing attention patterns, layer depth requirements, expert activations, and the effects of input design on table understanding.
Contribution
It provides a detailed analysis of LLMs' internal mechanisms for table understanding, including attention dynamics and expert activation patterns, which was previously unclear.
Findings
LLMs follow a three-phase attention pattern across layers.
Deeper layers are needed for stable predictions in tabular tasks.
MoE models activate table-specific experts in middle layers.
Abstract
Despite the success of Large Language Models (LLMs) in table understanding, their internal mechanisms remain unclear. In this paper, we conduct an empirical study on 16 LLMs, covering general LLMs, specialist tabular LLMs, and Mixture-of-Experts (MoE) models, to explore how LLMs understand tabular data and perform downstream tasks. Our analysis focus on 4 dimensions including the attention dynamics, the effective layer depth, the expert activation, and the impacts of input designs. Key findings include: (1) LLMs follow a three-phase attention pattern -- early layers scan the table broadly, middle layers localize relevant cells, and late layers amplify their contributions; (2) tabular tasks require deeper layers than math reasoning to reach stable predictions; (3) MoE models activate table-specific experts in middle layers, with early and late layers sharing general-purpose experts; (4)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Handwritten Text Recognition Techniques · Visual and Cognitive Learning Processes
