Learnware of Language Models: Specialized Small Language Models Can Do Big
Zhi-Hao Tan, Zi-Chen Zhao, Hao-Yu Shi, Xin-Yu Zhang, Peng Tan, Yang Yu, Zhi-Hua Zhou

TL;DR
This paper explores applying the learnware paradigm to language models, demonstrating that specialized small language models can outperform larger models in domain-specific tasks through selective reuse.
Contribution
It introduces a learnware system of specialized SLMs for language tasks, showing effective reuse and superior performance over larger models in specific domains.
Findings
System outperforms base SLMs on all benchmarks.
Achieves at least 14% improvement over larger LLMs in finance and medical tasks.
Demonstrates effective domain-specific model reuse without exposing data.
Abstract
The learnware paradigm offers a novel approach to machine learning by enabling users to reuse a set of well-trained models for tasks beyond the models' original purposes. It eliminates the need to build models from scratch, instead relying on specifications (representations of a model's capabilities) to identify and leverage the most suitable models for new tasks. While learnware has proven effective in many scenarios, its application to language models has remained largely unexplored. At the same time, large language models (LLMs) have demonstrated remarkable universal question-answering abilities, yet they face challenges in specialized scenarios due to data scarcity, privacy concerns, and high computational costs, thus more and more specialized small language models (SLMs) are being trained for specific domains. To address these limitations systematically, the learnware paradigm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Machine Learning in Healthcare · Topic Modeling
MethodsSparse Evolutionary Training · Balanced Selection
