MAGDi: Structured Distillation of Multi-Agent Interaction Graphs   Improves Reasoning in Smaller Language Models

Justin Chih-Yao Chen; Swarnadeep Saha; Elias Stengel-Eskin; Mohit; Bansal

arXiv:2402.01620·cs.CL·June 11, 2024·1 cites

MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models

Justin Chih-Yao Chen, Swarnadeep Saha, Elias Stengel-Eskin, Mohit, Bansal

PDF

Open Access 1 Repo

TL;DR

MAGDi is a structured distillation method that encodes multi-agent reasoning interactions as graphs, enabling smaller language models to improve reasoning performance efficiently and generalize better across tasks.

Contribution

This paper introduces MAGDi, a novel graph-based distillation approach that captures multi-agent interactions, enhancing smaller models' reasoning abilities beyond existing single-teacher methods.

Findings

01

MAGDi outperforms single and multi-teacher distillation methods on reasoning benchmarks.

02

It significantly improves out-of-domain generalization.

03

MAGDi achieves higher efficiency compared to its teacher models.

Abstract

Multi-agent interactions between Large Language Model (LLM) agents have shown major improvements on diverse reasoning tasks. However, these involve long generations from multiple models across several rounds, making them expensive. Moreover, these multi-agent approaches fail to provide a final, single model for efficient inference. To address this, we introduce MAGDi, a new method for structured distillation of the reasoning interactions between multiple LLMs into smaller LMs. MAGDi teaches smaller models by representing multi-agent interactions as graphs, augmenting a base student model with a graph encoder, and distilling knowledge using three objective functions: next-token prediction, a contrastive loss between correct and incorrect reasoning, and a graph-based objective to model the interaction structure. Experiments on seven widely used commonsense and math reasoning benchmarks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dinobby/magdi
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies

MethodsBalanced Selection