Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Yuncheng Yang, Yulei Qin, Tong Wu, Zihan Xu, Gang Li, Pengcheng Guo,, Hang Shao, Yuchen Shi, Ke Li, Xing Sun, Jie Yang, Yun Gu

TL;DR
This paper introduces a scalable pipeline that leverages open knowledge and minimal human-annotated samples to enhance task-specific expertise in large language models through a mixture-of-experts system.
Contribution
It proposes a novel, cost-efficient method combining K-shot samples and diversity principles to select and fine-tune experts using open knowledge for domain-specific tasks.
Findings
Outperforms existing methods in task expertise enhancement
Effectively utilizes open knowledge with minimal human annotation
Demonstrates superior performance across various tasks
Abstract
The cultivation of expertise for large language models (LLMs) to solve tasks of specific areas often requires special-purpose tuning with calibrated behaviors on the expected stable outputs. To avoid huge cost brought by manual preparation of instruction datasets and training resources up to hundreds of hours, the exploitation of open knowledge including a wealth of low rank adaptation (LoRA) models and instruction datasets serves as a good starting point. However, existing methods on model and data selection focus on the performance of general-purpose capabilities while neglecting the knowledge gap exposed in domain-specific deployment. In the present study, we propose to bridge such gap by introducing few human-annotated samples (i.e., K-shot) for advancing task expertise of LLMs with open knowledge. Specifically, we develop an efficient and scalable pipeline to cost-efficiently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsMixture of Experts · Focus
