MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models
Justin Chih-Yao Chen, Swarnadeep Saha, Elias Stengel-Eskin, Mohit, Bansal

TL;DR
MAGDi is a structured distillation method that encodes multi-agent reasoning interactions as graphs, enabling smaller language models to improve reasoning performance efficiently and generalize better across tasks.
Contribution
This paper introduces MAGDi, a novel graph-based distillation approach that captures multi-agent interactions, enhancing smaller models' reasoning abilities beyond existing single-teacher methods.
Findings
MAGDi outperforms single and multi-teacher distillation methods on reasoning benchmarks.
It significantly improves out-of-domain generalization.
MAGDi achieves higher efficiency compared to its teacher models.
Abstract
Multi-agent interactions between Large Language Model (LLM) agents have shown major improvements on diverse reasoning tasks. However, these involve long generations from multiple models across several rounds, making them expensive. Moreover, these multi-agent approaches fail to provide a final, single model for efficient inference. To address this, we introduce MAGDi, a new method for structured distillation of the reasoning interactions between multiple LLMs into smaller LMs. MAGDi teaches smaller models by representing multi-agent interactions as graphs, augmenting a base student model with a graph encoder, and distilling knowledge using three objective functions: next-token prediction, a contrastive loss between correct and incorrect reasoning, and a graph-based objective to model the interaction structure. Experiments on seven widely used commonsense and math reasoning benchmarks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies
MethodsBalanced Selection
