GraphWiz: An Instruction-Following Language Model for Graph Problems

Nuo Chen; Yuhan Li; Jianheng Tang; Jia Li

arXiv:2402.16029·cs.CL·July 4, 2024·1 cites

GraphWiz: An Instruction-Following Language Model for Graph Problems

Nuo Chen, Yuhan Li, Jianheng Tang, Jia Li

PDF

Open Access 1 Repo 8 Models

TL;DR

GraphWiz is a specialized language model trained with a new dataset to solve diverse graph problems with explicit reasoning, outperforming GPT-4 and providing insights into training data effects and transferability.

Contribution

We introduce GraphInstruct, a comprehensive instruction dataset, and develop GraphWiz, an open-source model capable of explicit graph problem reasoning, enhanced with DPO for improved accuracy.

Findings

01

GraphWiz achieves 65% accuracy across nine graph tasks.

02

GraphWiz surpasses GPT-4's average accuracy of 43.8%.

03

Training data volume impacts model performance and overfitting.

Abstract

Large language models (LLMs) have achieved impressive success across several fields, but their proficiency in understanding and resolving complex graph problems is less explored. To bridge this gap, we introduce GraphInstruct, a novel and comprehensive instruction-tuning dataset designed to equip language models with the ability to tackle a broad spectrum of graph problems using explicit reasoning paths. Utilizing GraphInstruct, we build GraphWiz, an open-source language model capable of resolving various graph problem types while generating clear reasoning processes. To enhance the model's capability and reliability, we incorporate the Direct Preference Optimization (DPO) framework into the graph problem-solving context. The enhanced model, GraphWiz-DPO, achieves an average accuracy of 65% across nine tasks with different complexity levels, surpassing GPT-4 which has an average…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nuochenpku/Graph-Reasoning-LLM
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Natural Language Processing Techniques · Topic Modeling

MethodsLinear Layer · Dropout · Layer Normalization · Byte Pair Encoding · Multi-Head Attention · Dense Connections · Label Smoothing · Adam · Attention Is All You Need · Softmax