SLearnLLM: A Self-Learning Framework for Efficient Domain-Specific Adaptation of Large Language Models

Xiang Liu; Zhaoxiang Liu; Peng Wang; Kohou Wang; Huan Hu; Kai Wang; and Shiguo Lian

arXiv:2505.17470·cs.CL·May 26, 2025

SLearnLLM: A Self-Learning Framework for Efficient Domain-Specific Adaptation of Large Language Models

Xiang Liu, Zhaoxiang Liu, Peng Wang, Kohou Wang, Huan Hu, Kai Wang, and Shiguo Lian

PDF

TL;DR

This paper introduces a self-learning framework for domain-specific adaptation of large language models that improves training efficiency by focusing on unknown knowledge within the dataset, reducing training time while maintaining performance.

Contribution

The proposed framework enables LLMs to identify and focus on unknown knowledge in the dataset, significantly reducing training time without sacrificing accuracy.

Findings

01

Reduces fine-tuning training time in agriculture and medicine domains.

02

Achieves comparable performance to full dataset fine-tuning.

03

Enhances training efficiency by filtering out known information.

Abstract

When using supervised fine-tuning (SFT) to adapt large language models (LLMs) to specific domains, a significant challenge arises: should we use the entire SFT dataset for fine-tuning? Common practice often involves fine-tuning directly on the entire dataset due to limited information on the LLM's past training data. However, if the SFT dataset largely overlaps with the model's existing knowledge, the performance gains are minimal, leading to wasted computational resources. Identifying the unknown knowledge within the SFT dataset and using it to fine-tune the model could substantially improve the training efficiency. To address this challenge, we propose a self-learning framework for LLMs inspired by human learning pattern. This framework takes a fine-tuning (SFT) dataset in a specific domain as input. First, the LLMs answer the questions in the SFT dataset. The LLMs then objectively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.