InstructGraph: Boosting Large Language Models via Graph-centric   Instruction Tuning and Preference Alignment

Jianing Wang; Junda Wu; Yupeng Hou; Yao Liu; Ming Gao; Julian McAuley

arXiv:2402.08785·cs.CL·February 15, 2024·1 cites

InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment

Jianing Wang, Junda Wu, Yupeng Hou, Yao Liu, Ming Gao, Julian McAuley

PDF

Open Access 1 Repo 1 Video

TL;DR

InstructGraph is a novel framework that enhances large language models' ability to perform graph reasoning and generation through instruction tuning, a universal graph data format, and preference alignment, significantly outperforming existing models.

Contribution

The paper introduces a unified graph verbalizer, a dedicated instruction tuning process, and a preference alignment strategy to improve LLMs' graph reasoning capabilities.

Findings

01

InstructGraph outperforms GPT-4 and LLaMA2 by over 13% and 38%.

02

The universal code-like format simplifies graph data representation.

03

Preference alignment reduces hallucinations in graph tasks.

Abstract

Do current large language models (LLMs) better solve graph reasoning and generation tasks with parameter updates? In this paper, we propose InstructGraph, a framework that empowers LLMs with the abilities of graph reasoning and generation by instruction tuning and preference alignment. Specifically, we first propose a structured format verbalizer to unify all graph data into a universal code-like format, which can simply represent the graph without any external graph-specific encoders. Furthermore, a graph instruction tuning stage is introduced to guide LLMs in solving graph reasoning and generation tasks. Finally, we identify potential hallucination problems in graph tasks and sample negative instances for preference alignment, the target of which is to enhance the output's reliability of the model. Extensive experiments across multiple graph-centric tasks exhibit that InstructGraph…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wjn1996/instructgraph
pytorchOfficial

Videos

InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks

MethodsPosition-Wise Feed-Forward Layer · Attention Is All You Need · Dropout · Linear Layer · Dense Connections · Label Smoothing · Absolute Position Encodings · Softmax · Byte Pair Encoding · Multi-Head Attention