Sub-goal Distillation: A Method to Improve Small Language Agents
Maryam Hashemzadeh, Elias Stengel-Eskin, Sarath Chandar,, Marc-Alexandre Cote

TL;DR
This paper introduces Sub-goal Distillation, a hierarchical approach that transfers LLM capabilities to smaller models by learning sub-goals, enabling efficient, cost-effective performance in complex interactive tasks.
Contribution
The paper presents a novel hierarchical distillation method that enables small language models to perform complex tasks by learning sub-goals from large models, reducing reliance on costly LLM calls.
Findings
Achieved 16.7% improvement over imitation learning with elementary actions
Reduced LLM interaction costs to a fixed, minimal amount
Outperformed standard imitation learning in ScienceWorld environment
Abstract
While Large Language Models (LLMs) have demonstrated significant promise as agents in interactive tasks, their substantial computational requirements and restricted number of calls constrain their practical utility, especially in long-horizon interactive tasks such as decision-making or in scenarios involving continuous ongoing tasks. To address these constraints, we propose a method for transferring the performance of an LLM with billions of parameters to a much smaller language model (770M parameters). Our approach involves constructing a hierarchical agent comprising a planning module, which learns through Knowledge Distillation from an LLM to generate sub-goals, and an execution module, which learns to accomplish these sub-goals using elementary actions. In detail, we leverage an LLM to annotate an oracle path with a sequence of sub-goals towards completing a goal. Subsequently, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Natural Language Processing Techniques · Speech and dialogue systems
MethodsKnowledge Distillation
