Learnware of Language Models: Specialized Small Language Models Can Do Big

Zhi-Hao Tan; Zi-Chen Zhao; Hao-Yu Shi; Xin-Yu Zhang; Peng Tan; Yang Yu; Zhi-Hua Zhou

arXiv:2505.13425·cs.LG·May 20, 2025

Learnware of Language Models: Specialized Small Language Models Can Do Big

Zhi-Hao Tan, Zi-Chen Zhao, Hao-Yu Shi, Xin-Yu Zhang, Peng Tan, Yang Yu, Zhi-Hua Zhou

PDF

Open Access 1 Repo

TL;DR

This paper explores applying the learnware paradigm to language models, demonstrating that specialized small language models can outperform larger models in domain-specific tasks through selective reuse.

Contribution

It introduces a learnware system of specialized SLMs for language tasks, showing effective reuse and superior performance over larger models in specific domains.

Findings

01

System outperforms base SLMs on all benchmarks.

02

Achieves at least 14% improvement over larger LLMs in finance and medical tasks.

03

Demonstrates effective domain-specific model reuse without exposing data.

Abstract

The learnware paradigm offers a novel approach to machine learning by enabling users to reuse a set of well-trained models for tasks beyond the models' original purposes. It eliminates the need to build models from scratch, instead relying on specifications (representations of a model's capabilities) to identify and leverage the most suitable models for new tasks. While learnware has proven effective in many scenarios, its application to language models has remained largely unexplored. At the same time, large language models (LLMs) have demonstrated remarkable universal question-answering abilities, yet they face challenges in specialized scenarios due to data scarcity, privacy concerns, and high computational costs, thus more and more specialized small language models (SLMs) are being trained for specific domains. To address these limitations systematically, the learnware paradigm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

learnware-lamda/learnware
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Machine Learning in Healthcare · Topic Modeling

MethodsSparse Evolutionary Training · Balanced Selection