Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large   Language Models

Alfonso Amayuelas; Kyle Wong; Liangming Pan; Wenhu Chen; William Wang

arXiv:2305.13712·cs.CL·July 3, 2024·1 cites

Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models

Alfonso Amayuelas, Kyle Wong, Liangming Pan, Wenhu Chen, William Wang

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper explores how large language models understand and articulate their uncertainty regarding known-unknown questions, introducing a new dataset and demonstrating improved performance in uncertainty detection and expression.

Contribution

It introduces a novel dataset of known-unknown questions and a framework for analyzing uncertainty, along with fine-tuning methods that enhance LLMs' ability to identify and express uncertainty.

Findings

01

Fine-tuned models show significant F1-score improvements in uncertainty detection.

02

Models better distinguish between known and unknown questions after fine-tuning.

03

Enhanced uncertainty articulation improves multi-agent debate performance.

Abstract

This paper investigates the capabilities of Large Language Models (LLMs) in the context of understanding their knowledge and uncertainty over questions. Specifically, we focus on addressing known-unknown questions, characterized by high uncertainty due to the absence of definitive answers. To facilitate our study, we collect a new dataset with Known-Unknown Questions (KUQ) and establish a categorization framework to clarify the origins of uncertainty in such queries. Subsequently, we examine the performance of open-source LLMs, fine-tuned using this dataset, in distinguishing between known and unknown queries within open-ended question-answering scenarios. The fine-tuned models demonstrated a significant improvement, achieving a considerable increase in F1-score relative to their pre-fine-tuning state. Through a comprehensive analysis, we reveal insights into the models' improved…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amayuelas/knowledge-of-knowledge
pytorchOfficial

Datasets

amayuelas/KUQ
dataset· 1.7k dl
1.7k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification