Distilling LLM Agent into Small Models with Retrieval and Code Tools

Minki Kang; Jongwon Jeong; Seanie Lee; Jaewoong Cho; Sung Ju Hwang

arXiv:2505.17612·cs.CL·November 6, 2025

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Minki Kang, Jongwon Jeong, Seanie Lee, Jaewoong Cho, Sung Ju Hwang

PDF

1 Repo 8 Models 4 Datasets 1 Video

TL;DR

This paper introduces Agent Distillation, a method to transfer full task-solving abilities from large language models to smaller models using retrieval and code tools, enhancing small models' reasoning and robustness.

Contribution

It proposes a novel agent distillation framework with new prompting and self-consistent action generation techniques for small models.

Findings

01

Small models (0.5B-3B) achieve performance comparable to larger models.

02

Enhanced robustness and accuracy in reasoning tasks.

03

Effective transfer of reasoning and task-solving capabilities.

Abstract

Large language models (LLMs) excel at complex reasoning tasks but remain computationally expensive, limiting their practical deployment. To address this, recent works have focused on distilling reasoning capabilities into smaller language models (sLMs) using chain-of-thought (CoT) traces from teacher LLMs. However, this approach struggles in scenarios requiring rare factual knowledge or precise computation, where sLMs often hallucinate due to limited capability. In this work, we propose Agent Distillation, a framework for transferring not only reasoning capability but full task-solving behavior from LLM-based agents into sLMs with retrieval and code tools. We improve agent distillation along two complementary axes: (1) we introduce a prompting method called first-thought prefix to enhance the quality of teacher-generated trajectories; and (2) we propose a self-consistent action…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nardien/agent-distillation
pytorchOfficial

Models

Datasets

Videos

Distilling LLM Agent into Small Models with Retrieval and Code Tools· slideslive