Learning to Seek Help: Dynamic Collaboration Between Small and Large Language Models

Hang Zeng; Xiangyu Liu; Yong Hu; Chaoyue Niu; Jiarui Zhang; Shaojie Tang; Fan Wu; Guihai Chen

arXiv:2604.17827·cs.CL·April 21, 2026

Learning to Seek Help: Dynamic Collaboration Between Small and Large Language Models

Hang Zeng, Xiangyu Liu, Yong Hu, Chaoyue Niu, Jiarui Zhang, Shaojie Tang, Fan Wu, Guihai Chen

PDF

TL;DR

This paper proposes a dynamic collaboration framework where small and large language models work together, with the SLM learning to request help from the LLM during reasoning, improving efficiency and privacy.

Contribution

It introduces a novel adaptive collaboration strategy enabling SLMs to proactively seek LLM assistance, outperforming static methods and generalizing to unseen models.

Findings

01

Stronger SLMs become more self-reliant.

02

Stronger LLMs enable fewer, more informative interactions.

03

Dynamic strategies outperform static pipelines and transfer well.

Abstract

Large language models (LLMs) offer strong capabilities but raise cost and privacy concerns, whereas small language models (SLMs) facilitate efficient and private local inference yet suffer from limited capacity. To synergize the complementary strengths, we introduce a dynamic collaboration framework, where an SLM learns to proactively decide how to request an LLM during multi-step reasoning, while the LLM provides adaptive feedback instead of acting as a passive tool. We further systematically investigate how collaboration strategies are shaped by SLM and LLM capabilities as well as efficiency and privacy constraints. Evaluation results reveal a distinct scaling effect: stronger SLMs become more self-reliant, while stronger LLMs enable fewer and more informative interactions. In addition, the learned dynamic collaboration strategies significantly outperform static pipelines and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.