GraphWiz: An Instruction-Following Language Model for Graph Problems
Nuo Chen, Yuhan Li, Jianheng Tang, Jia Li

TL;DR
GraphWiz is a specialized language model trained with a new dataset to solve diverse graph problems with explicit reasoning, outperforming GPT-4 and providing insights into training data effects and transferability.
Contribution
We introduce GraphInstruct, a comprehensive instruction dataset, and develop GraphWiz, an open-source model capable of explicit graph problem reasoning, enhanced with DPO for improved accuracy.
Findings
GraphWiz achieves 65% accuracy across nine graph tasks.
GraphWiz surpasses GPT-4's average accuracy of 43.8%.
Training data volume impacts model performance and overfitting.
Abstract
Large language models (LLMs) have achieved impressive success across several fields, but their proficiency in understanding and resolving complex graph problems is less explored. To bridge this gap, we introduce GraphInstruct, a novel and comprehensive instruction-tuning dataset designed to equip language models with the ability to tackle a broad spectrum of graph problems using explicit reasoning paths. Utilizing GraphInstruct, we build GraphWiz, an open-source language model capable of resolving various graph problem types while generating clear reasoning processes. To enhance the model's capability and reliability, we incorporate the Direct Preference Optimization (DPO) framework into the graph problem-solving context. The enhanced model, GraphWiz-DPO, achieves an average accuracy of 65% across nine tasks with different complexity levels, surpassing GPT-4 which has an average…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗GraphWiz/LLaMA2-7Bmodel· 188 dl· ♡ 1188 dl♡ 1
- 🤗GraphWiz/Mistral-7Bmodel· 15 dl15 dl
- 🤗GraphWiz/Mistral-7B-RFTmodel· 9 dl9 dl
- 🤗GraphWiz/LLaMA2-7B-RFTmodel· 14 dl· ♡ 114 dl♡ 1
- 🤗GraphWiz/LLaMA2-7B-DPOmodel· 152 dl152 dl
- 🤗GraphWiz/LLaMA2-13Bmodel· 145 dl145 dl
- 🤗GraphWiz/LLaMA2-13B-RFTmodel· 10 dl· ♡ 210 dl♡ 2
- 🤗GraphWiz/LLaMA2-13B-DPOmodel· 7 dl7 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Natural Language Processing Techniques · Topic Modeling
MethodsLinear Layer · Dropout · Layer Normalization · Byte Pair Encoding · Multi-Head Attention · Dense Connections · Label Smoothing · Adam · Attention Is All You Need · Softmax
