Supervised Knowledge Makes Large Language Models Better In-context   Learners

Linyi Yang; Shuibai Zhang; Zhuohao Yu; Guangsheng Bao; Yidong Wang,; Jindong Wang; Ruochen Xu; Wei Ye; Xing Xie; Weizhu Chen; Yue Zhang

arXiv:2312.15918·cs.CL·April 12, 2024·2 cites

Supervised Knowledge Makes Large Language Models Better In-context Learners

Linyi Yang, Shuibai Zhang, Zhuohao Yu, Guangsheng Bao, Yidong Wang,, Jindong Wang, Ruochen Xu, Wei Ye, Xing Xie, Weizhu Chen, Yue Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a simple framework that uses supervised knowledge to improve large language models' in-context learning, enhancing their generalizability, factuality, and reducing hallucinations during inference.

Contribution

The authors propose a plug-in method leveraging task-specific fine-tuned models to boost LLMs' reliability and performance in out-of-distribution scenarios and factual accuracy.

Findings

01

Enhanced Llama 2 and ChatGPT outperform original models in generalizability.

02

The method reduces hallucinations in generative tasks.

03

Incorporating discriminative models benefits LLMs' in-context learning.

Abstract

Large Language Models (LLMs) exhibit emerging in-context learning abilities through prompt engineering. The recent progress in large-scale generative models has further expanded their use in real-world language applications. However, the critical challenge of improving the generalizability and factuality of LLMs in natural language understanding and question answering remains under-explored. While previous in-context learning research has focused on enhancing models to adhere to users' specific instructions and quality expectations, and to avoid undesired outputs, little to no work has explored the use of task-Specific fine-tuned Language Models (SLMs) to improve LLMs' in-context learning during the inference stage. Our primary contribution is the establishment of a simple yet effective framework that enhances the reliability of LLMs as it: 1) generalizes out-of-distribution data, 2)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yanglinyi/supervised-knowledge-makes-large-language-models-better-in-context-learners
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification