Knowledgeable In-Context Tuning: Exploring and Exploiting Factual   Knowledge for In-Context Learning

Jianing Wang; Chengyu Wang; Chuanqi Tan; Jun Huang; Ming Gao

arXiv:2309.14771·cs.CL·April 2, 2024

Knowledgeable In-Context Tuning: Exploring and Exploiting Factual Knowledge for In-Context Learning

Jianing Wang, Chengyu Wang, Chuanqi Tan, Jun Huang, Ming Gao

PDF

Open Access

TL;DR

This paper introduces Knowledgeable In-Context Tuning (KICT), a framework that enhances large language models' in-context learning by integrating factual knowledge, selecting relevant examples, and calibrating outputs, leading to significant performance gains.

Contribution

The paper proposes a novel KICT framework that injects knowledge during pre-training, selects knowledge-relevant examples, and calibrates predictions, improving ICL performance in LLMs.

Findings

01

KICT outperforms strong baselines on multiple tasks.

02

Achieves over 13% improvement in text classification.

03

Achieves over 7% improvement in question-answering.

Abstract

Large language models (LLMs) enable in-context learning (ICL) by conditioning on a few labeled training examples as a text-based prompt, eliminating the need for parameter updates and achieving competitive performance. In this paper, we demonstrate that factual knowledge is imperative for the performance of ICL in three core facets: the inherent knowledge learned in LLMs, the factual knowledge derived from the selected in-context examples, and the knowledge biases in LLMs for output generation. To unleash the power of LLMs in few-shot learning scenarios, we introduce a novel Knowledgeable In-Context Tuning (KICT) framework to further improve the performance of ICL: 1) injecting knowledge into LLMs during continual self-supervised pre-training, 2) judiciously selecting the examples for ICL with high knowledge relevance, and 3) calibrating the prediction results based on prior knowledge.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning