TEGEE: Task dEfinition Guided Expert Ensembling for Generalizable and   Few-shot Learning

Xingwei Qu; Yiming Liang; Yucheng Wang; Tianyu Zheng; Tommy Yue,; Xingyuan Bu; Lei Ma; Stephen W. Huang; Jiajun Zhang; Yinan Shi; Chenghua Lin,; Jie Fu; Ge Zhang

arXiv:2403.04233·cs.CL·December 17, 2024·2 cites

TEGEE: Task dEfinition Guided Expert Ensembling for Generalizable and Few-shot Learning

Xingwei Qu, Yiming Liang, Yucheng Wang, Tianyu Zheng, Tommy Yue,, Xingyuan Bu, Lei Ma, Stephen W. Huang, Jiajun Zhang, Yinan Shi, Chenghua Lin,, Jie Fu, Ge Zhang

PDF

Open Access

TL;DR

TEGEE introduces a modular framework that explicitly extracts task definitions to improve few-shot and many-shot learning in LLMs, outperforming traditional in-context learning methods.

Contribution

The paper proposes TEGEE, a novel task definition guided ensembling method that emphasizes explicit task extraction and extends ICL to support unlimited demonstrations.

Findings

01

TEGEE performs comparably to larger models like LLaMA2-13B.

02

The modular design enhances continual learning and scalability.

03

Explicit task definition extraction is more crucial than task processing.

Abstract

Large Language Models (LLMs) exhibit the ability to perform in-context learning (ICL), where they acquire new tasks directly from examples provided in demonstrations. This process is thought to operate through an implicit task selection mechanism that involves extracting and processing task definitions from these demonstrations. However, critical questions remain: Which is more essential -- task extraction or definition? And how can these capabilities be further improved? To address these questions, we propose \textbf{TEGEE} (Task Definition Guided Expert Ensembling), a method that explicitly extracts task definitions and generates responses based on specific tasks. Our framework employs a dual 3B model approach, with each model assigned a distinct role: one focuses on task definition extraction, while the other handles learning from demonstrations. This modular approach supports the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification