Leveraging Open Knowledge for Advancing Task Expertise in Large Language   Models

Yuncheng Yang; Yulei Qin; Tong Wu; Zihan Xu; Gang Li; Pengcheng Guo,; Hang Shao; Yuchen Shi; Ke Li; Xing Sun; Jie Yang; Yun Gu

arXiv:2408.15915·cs.CV·September 10, 2024

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

Yuncheng Yang, Yulei Qin, Tong Wu, Zihan Xu, Gang Li, Pengcheng Guo,, Hang Shao, Yuchen Shi, Ke Li, Xing Sun, Jie Yang, Yun Gu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a scalable pipeline that leverages open knowledge and minimal human-annotated samples to enhance task-specific expertise in large language models through a mixture-of-experts system.

Contribution

It proposes a novel, cost-efficient method combining K-shot samples and diversity principles to select and fine-tune experts using open knowledge for domain-specific tasks.

Findings

01

Outperforms existing methods in task expertise enhancement

02

Effectively utilizes open knowledge with minimal human annotation

03

Demonstrates superior performance across various tasks

Abstract

The cultivation of expertise for large language models (LLMs) to solve tasks of specific areas often requires special-purpose tuning with calibrated behaviors on the expected stable outputs. To avoid huge cost brought by manual preparation of instruction datasets and training resources up to hundreds of hours, the exploitation of open knowledge including a wealth of low rank adaptation (LoRA) models and instruction datasets serves as a good starting point. However, existing methods on model and data selection focus on the performance of general-purpose capabilities while neglecting the knowledge gap exposed in domain-specific deployment. In the present study, we propose to bridge such gap by introducing few human-annotated samples (i.e., K-shot) for advancing task expertise of LLMs with open knowledge. Specifically, we develop an efficient and scalable pipeline to cost-efficiently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yaphabates/rocket
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsMixture of Experts · Focus