HeGTa: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding
Rihui Jin, Yu Li, Guilin Qi, Nan Hu, Yuan-Fang Li, Jiaoyan Chen,, Jianan Wang, Yongrui Chen, Dehai Min, Sheng Bi

TL;DR
HGT is a novel framework combining heterogeneous graph-enhanced LLMs with multi-task pre-training to improve few-shot complex table understanding, addressing data scarcity and structural complexity.
Contribution
It introduces a heterogeneous graph-enhanced LLM framework with multi-granularity self-supervised pre-training for few-shot complex table understanding.
Findings
Outperforms state-of-the-art on several benchmarks.
Effective in handling complex table structures.
Enhances few-shot learning capabilities.
Abstract
Table understanding (TU) has achieved promising advancements, but it faces the challenges of the scarcity of manually labeled tables and the presence of complex table structures.To address these challenges, we propose HGT, a framework with a heterogeneous graph (HG)-enhanced large language model (LLM) to tackle few-shot TU tasks.It leverages the LLM by aligning the table semantics with the LLM's parametric knowledge through soft prompts and instruction turning and deals with complex tables by a multi-task pre-training scheme involving three novel multi-granularity self-supervised HG pre-training objectives.We empirically demonstrate the effectiveness of HGT, showing that it outperforms the SOTA for few-shot complex TU on several benchmarks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Data Quality and Management · Natural Language Processing Techniques
