Federated Reasoning Distillation Framework with Model Learnability-Aware Data Allocation

Wei Guo; Siyuan Lu; Xiangdong Ran; Yiqi Tong; Yikun Ban; Zelong Xu; Jing Fan; Zixuan Huang; Xiao Zhang; Zhaojun Hu; Fuzhen Zhuang

arXiv:2602.18749·cs.AI·February 24, 2026

Federated Reasoning Distillation Framework with Model Learnability-Aware Data Allocation

Wei Guo, Siyuan Lu, Xiangdong Ran, Yiqi Tong, Yikun Ban, Zelong Xu, Jing Fan, Zixuan Huang, Xiao Zhang, Zhaojun Hu, Fuzhen Zhuang

PDF

Open Access

TL;DR

This paper introduces LaDa, a federated reasoning distillation framework that adaptively allocates data based on model learnability gaps and employs domain adaptive distillation to enhance reasoning transfer between large and small language models.

Contribution

It proposes a novel learnability-aware data allocation method and a domain adaptive reasoning distillation technique for improved federated LLM and SLM collaboration.

Findings

01

Effective bidirectional knowledge transfer facilitated

02

Enhanced reasoning pattern acquisition in SLMs

03

Flexible adaptation to local data domains achieved

Abstract

Data allocation plays a critical role in federated large language model (LLM) and small language models (SLMs) reasoning collaboration. Nevertheless, existing data allocation methods fail to address an under-explored challenge in collaboration: bidirectional model learnability gap, where client-side SLMs cannot identify high-reward samples matching their learnability constraints for effective knowledge transfer from LLMs, while LLMs struggle to select samples contributing novel knowledge beyond their existing data. Furthermore, these collaboration frameworks face another key challenge: domain-agnostic reasoning transfer, where existing reasoning transfer methods fail to flexibly adapt to the local domain data, preventing SLMs from effectively acquiring step-by-step reasoning abilities within from general LLM. To address these challenges, we propose LaDa, a federated reasoning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Natural Language Processing Techniques