Know the Unknown: An Uncertainty-Sensitive Method for LLM Instruction Tuning

Jiaqi Li; Yixuan Tang; Yi Yang

arXiv:2406.10099·cs.CL·August 8, 2025·1 cites

Know the Unknown: An Uncertainty-Sensitive Method for LLM Instruction Tuning

Jiaqi Li, Yixuan Tang, Yi Yang

PDF

Open Access 1 Repo 8 Models

TL;DR

This paper introduces US-Tuning, a two-stage method that improves large language models' ability to recognize their knowledge limits and reduces hallucinations, leading to more accurate and faithful responses.

Contribution

The paper presents US-Tuning, a novel two-stage instruction tuning approach that enhances LLMs' awareness of their knowledge boundaries and adherence to instructions.

Findings

01

US-Tuning significantly reduces incorrect answers in contextual QA.

02

Fine-tuned Llama2-7B improves out-of-knowledge question handling by 34.7%.

03

Model outperforms GPT-4 by 4.2% in overall performance.

Abstract

Large language models (LLMs) demonstrate remarkable capabilities but face challenges from hallucinations, which typically arise from insufficient knowledge or context. While instructing LLMs to acknowledge knowledge limitations by responding with "I don't know" appears promising, we find that models consistently struggle with admitting knowledge gaps. This challenge may originate from current instruction datasets that emphasise answer generation over knowledge boundary awareness. To address this limitation, we introduce Uncertainty-and-Sensitivity-Aware Tuning (US-Tuning), a novel two-stage approach for contextual question answering (QA). The first stage enhances LLMs' ability to recognise their knowledge boundaries, while the second stage reinforces instruction adherence through carefully designed causal prompts. Our experimental results demonstrate that US-Tuning not only…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jiaqili404/trustworthyrag
noneOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Processing Techniques · Natural Language Processing Techniques · Experimental Learning in Engineering

MethodsAttention Is All You Need · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer