FedCoT: Federated Chain-of-Thought Distillation for Large Language Models

Tao Fan; Weijing Chen; Yan Kang; Guoqiang Ma; Hanlin Gu; Yuanfeng Song; Lixin Fan; Qiang Yang

arXiv:2406.12403·cs.CL·November 11, 2025·1 cites

FedCoT: Federated Chain-of-Thought Distillation for Large Language Models

Tao Fan, Weijing Chen, Yan Kang, Guoqiang Ma, Hanlin Gu, Yuanfeng Song, Lixin Fan, Qiang Yang

PDF

Open Access

TL;DR

FedCoT introduces a federated framework for transferring knowledge from large to small language models using Chain-of-Thought distillation, ensuring data privacy and improved performance in resource-constrained environments.

Contribution

It presents a novel federated approach with privacy-preserving strategies for Chain-of-Thought distillation from LLMs to SLMs, addressing privacy and efficiency challenges.

Findings

01

Enhanced SLM performance on text generation tasks

02

Effective privacy protection via proposed mechanisms

03

Feasible federated knowledge transfer in resource-limited settings

Abstract

Large Language Models (LLMs) have emerged as a transformative force in artificial intelligence, demonstrating exceptional proficiency across various tasks. However, their deployment in resource-constrained environments and concerns over user data privacy pose significant challenges. In contrast, Small Language Models (SLMs) offer computational efficiency but often lag in performance. To address these issues, we propose FedCoT, a federated framework designed for the Chain-of-Thought (CoT) distillation of knowledge from LLMs to SLMs, while ensuring the preservation of clients' data privacy. FedCoT ensures secure and efficient knowledge transfer from an LLM on a high-powered server to an SLM on a resource-constrained client, while adhering to privacy requirements. Leveraging perturbed prompts and rationales generated through the CoT approach, the framework enhances the performance of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data