LLaSA: Large Language and Structured Data Assistant
Yao Xu, Shizhu He, Jiabei Chen, Zeng Xiangrong, Bingning Wang, Guang, Liu, Jun Zhao, Kang Liu

TL;DR
LLaSA introduces a unified hypergraph-based framework to enhance large language models' ability to process diverse structured data, improving performance on knowledge grounding tasks through pretraining and fine-tuning.
Contribution
The paper proposes a general hypergraph encoding framework for structured data, enabling LLMs to better understand and utilize various structured formats across different models.
Findings
Pretrained hypergraph encoder improves LLM performance on SKG tasks.
LLaSA outperforms previous state-of-the-art methods with LoRA fine-tuning.
Unified hypergraph representation enables flexible handling of diverse structured data.
Abstract
Structured data, such as tables, graphs, and databases, play a critical role in plentiful NLP tasks such as question answering and dialogue system. Recently, inspired by Vision-Language Models, Graph Neutral Networks (GNNs) have been introduced as an additional modality into the input of Large Language Models (LLMs) to improve their performance on Structured Knowledge Grounding (SKG) tasks. However, those GNN-enhanced LLMs have the following limitations: (1) They employ diverse GNNs to model varying types of structured data, rendering them unable to uniformly process various forms of structured data. (2) The pretraining of GNNs is coupled with specific LLMs, which prevents GNNs from fully aligning with the textual space and limits their adaptability to other LLMs. To address these issues, we propose \textbf{L}arge \textbf{L}anguage and \textbf{S}tructured Data \textbf{A}ssistant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSemantic Web and Ontologies · Data Mining Algorithms and Applications · Natural Language Processing Techniques
