Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering
Zhihua Wen, Zhiliang Tian, Zexin Jian, Zhen Huang, Pei Ke, Yifu Gao,, Minlie Huang, Dongsheng Li

TL;DR
This paper investigates the knowledge boundary of large language models using semi-open-ended questions, revealing their limitations and proposing a method to better perceive their knowledge scope and ambiguous answers.
Contribution
It introduces a novel approach to perceive LLMs' knowledge boundary with semi-open-ended questions by discovering ambiguous answers using an auxiliary model.
Findings
GPT-4 performs poorly on semi-open-ended questions.
LLaMA-2-13B effectively discovers ambiguous answers.
The method constructs a dataset to evaluate LLMs' knowledge boundary.
Abstract
Large Language Models (LLMs) are widely used for knowledge-seeking yet suffer from hallucinations. The knowledge boundary (KB) of an LLM limits its factual understanding, beyond which it may begin to hallucinate. Investigating the perception of LLMs' KB is crucial for detecting hallucinations and LLMs' reliable generation. Current studies perceive LLMs' KB on questions with a concrete answer (close-ended questions) while paying limited attention to semi-open-ended questions (SoeQ) that correspond to many potential answers. Some researchers achieve it by judging whether the question is answerable or not. However, this paradigm is unsuitable for SoeQ, which are usually partially answerable, containing both answerable and ambiguous (unanswerable) answers. Ambiguous answers are essential for knowledge-seeking, but they may go beyond the KB of LLMs. In this paper, we perceive the LLMs' KB…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Label Smoothing · Adam · Absolute Position Encodings · Dropout
