Perception of Knowledge Boundary for Large Language Models through   Semi-open-ended Question Answering

Zhihua Wen; Zhiliang Tian; Zexin Jian; Zhen Huang; Pei Ke; Yifu Gao,; Minlie Huang; Dongsheng Li

arXiv:2405.14383·cs.CL·May 24, 2024

Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering

Zhihua Wen, Zhiliang Tian, Zexin Jian, Zhen Huang, Pei Ke, Yifu Gao,, Minlie Huang, Dongsheng Li

PDF

Open Access 1 Video

TL;DR

This paper investigates the knowledge boundary of large language models using semi-open-ended questions, revealing their limitations and proposing a method to better perceive their knowledge scope and ambiguous answers.

Contribution

It introduces a novel approach to perceive LLMs' knowledge boundary with semi-open-ended questions by discovering ambiguous answers using an auxiliary model.

Findings

01

GPT-4 performs poorly on semi-open-ended questions.

02

LLaMA-2-13B effectively discovers ambiguous answers.

03

The method constructs a dataset to evaluate LLMs' knowledge boundary.

Abstract

Large Language Models (LLMs) are widely used for knowledge-seeking yet suffer from hallucinations. The knowledge boundary (KB) of an LLM limits its factual understanding, beyond which it may begin to hallucinate. Investigating the perception of LLMs' KB is crucial for detecting hallucinations and LLMs' reliable generation. Current studies perceive LLMs' KB on questions with a concrete answer (close-ended questions) while paying limited attention to semi-open-ended questions (SoeQ) that correspond to many potential answers. Some researchers achieve it by judging whether the question is answerable or not. However, this paradigm is unsuitable for SoeQ, which are usually partially answerable, containing both answerable and ambiguous (unanswerable) answers. Ambiguous answers are essential for knowledge-seeking, but they may go beyond the KB of LLMs. In this paper, we perceive the LLMs' KB…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Label Smoothing · Adam · Absolute Position Encodings · Dropout