Iterative Graph Alignment

Fangyuan Yu; Hardeep Singh Arora; Matt Johnson

arXiv:2408.16667·cs.LG·August 30, 2024

Iterative Graph Alignment

Fangyuan Yu, Hardeep Singh Arora, Matt Johnson

PDF

Open Access 1 Repo

TL;DR

The paper introduces Iterative Graph Alignment (IGA), a scalable, annotation-free rule-based alignment method for large language models, improving their ability to follow rules through iterative supervised fine-tuning.

Contribution

IGA is a novel, annotation-free alignment algorithm that uses iterative graph prompting and collaborative fine-tuning to enhance rule adherence in LLMs.

Findings

01

73.12% alignment improvement in Claude Sonnet 3.5

02

86.20% alignment improvement in Llama3-8B-Instruct

03

Outperforms existing rule-based alignment methods

Abstract

By compressing diverse narratives, LLMs go beyond memorization, achieving intelligence by capturing generalizable causal relationships. However, they suffer from local 'representation gaps' due to insufficient training data diversity, limiting their real-world utility, especially in tasks requiring strict alignment to rules. Traditional alignment methods relying on heavy human annotations are inefficient and unscalable. Recent self-alignment techniques also fall short, as they often depend on self-selection based prompting and memorization-based learning. To address these issues, we introduce Iterative Graph Alignment (IGA), an annotation-free rule-based alignment algorithm. A teacher model (VLM) employs Iterative Graph Prompting (IGP) to create logical graphs and reference answers. The student model (LLM) identifies local knowledge gaps by attempting to align its responses with these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fangyuan-ksgk/ruleeval
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Semantic Web and Ontologies

MethodsALIGN