Know the Unknown: An Uncertainty-Sensitive Method for LLM Instruction Tuning
Jiaqi Li, Yixuan Tang, Yi Yang

TL;DR
This paper introduces US-Tuning, a two-stage method that improves large language models' ability to recognize their knowledge limits and reduces hallucinations, leading to more accurate and faithful responses.
Contribution
The paper presents US-Tuning, a novel two-stage instruction tuning approach that enhances LLMs' awareness of their knowledge boundaries and adherence to instructions.
Findings
US-Tuning significantly reduces incorrect answers in contextual QA.
Fine-tuned Llama2-7B improves out-of-knowledge question handling by 34.7%.
Model outperforms GPT-4 by 4.2% in overall performance.
Abstract
Large language models (LLMs) demonstrate remarkable capabilities but face challenges from hallucinations, which typically arise from insufficient knowledge or context. While instructing LLMs to acknowledge knowledge limitations by responding with "I don't know" appears promising, we find that models consistently struggle with admitting knowledge gaps. This challenge may originate from current instruction datasets that emphasise answer generation over knowledge boundary awareness. To address this limitation, we introduce Uncertainty-and-Sensitivity-Aware Tuning (US-Tuning), a novel two-stage approach for contextual question answering (QA). The first stage enhances LLMs' ability to recognise their knowledge boundaries, while the second stage reinforces instruction adherence through carefully designed causal prompts. Our experimental results demonstrate that US-Tuning not only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Processing Techniques · Natural Language Processing Techniques · Experimental Learning in Engineering
MethodsAttention Is All You Need · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer
