Mutual Enhancement of Large and Small Language Models with Cross-Silo Knowledge Transfer
Yongheng Deng, Ziqing Qiao, Ju Ren, Yang Liu, Yaoxue Zhang

TL;DR
This paper introduces CrossLM, a method where large and small language models mutually improve each other through cross-silo knowledge transfer, enhancing task-specific performance without compromising generalization.
Contribution
The paper presents a novel framework enabling mutual enhancement of LLMs and SLMs via cross-silo knowledge transfer, addressing privacy and performance issues.
Findings
Significant performance improvements for SLMs on client tasks.
Enhanced task-specific accuracy for LLMs without losing generalization.
Effective mutual enhancement demonstrated across benchmark tasks.
Abstract
While large language models (LLMs) are empowered with broad knowledge, their task-specific performance is often suboptimal. It necessitates fine-tuning LLMs with task-specific data, but such data may be inaccessible due to privacy concerns. In this paper, we propose a novel approach to enhance LLMs with smaller language models (SLMs) that are trained on clients using their private task-specific data. To enable mutual enhancement between LLMs and SLMs, we propose CrossLM, where the SLMs promote the LLM to generate task-specific high-quality data, and both the LLM and SLMs are enhanced with the generated data. We evaluate CrossLM using publicly accessible language models across a range of benchmark tasks. The results demonstrate that CrossLM significantly enhances the task-specific performance of SLMs on clients and the LLM on the cloud server simultaneously while preserving the LLM's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education
