Let Your Graph Do the Talking: Encoding Structured Data for LLMs

Bryan Perozzi; Bahare Fatemi; Dustin Zelle; Anton Tsitsulin; Mehran; Kazemi; Rami Al-Rfou; Jonathan Halcrow

arXiv:2402.05862·cs.LG·February 9, 2024·6 cites

Let Your Graph Do the Talking: Encoding Structured Data for LLMs

Bryan Perozzi, Bahare Fatemi, Dustin Zelle, Anton Tsitsulin, Mehran, Kazemi, Rami Al-Rfou, Jonathan Halcrow

PDF

Open Access 2 Repos

TL;DR

This paper presents GraphToken, a parameter-efficient method for encoding structured data into prompts for large language models, significantly improving reasoning performance across various graph tasks.

Contribution

Introducing GraphToken, the first general encoding method for structured data that enhances LLM reasoning across multiple graph-based tasks.

Findings

01

Up to 73% improvement on graph reasoning tasks

02

Explicit graph encoding enhances LLM performance

03

Applicable across node, edge, and graph-level tasks

Abstract

How can we best encode structured data into sequential form for use in large language models (LLMs)? In this work, we introduce a parameter-efficient method to explicitly represent structured data for LLMs. Our method, GraphToken, learns an encoding function to extend prompts with explicit structured information. Unlike other work which focuses on limited domains (e.g. knowledge graph representation), our work is the first effort focused on the general encoding of structured data to be used for various reasoning tasks. We show that explicitly representing the graph structure allows significant improvements to graph reasoning tasks. Specifically, we see across the board improvements - up to 73% points - on node, edge and, graph-level tasks from the GraphQA benchmark.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Biomedical Text Mining and Ontologies · Scientific Computing and Data Management