Deep Learning-based Method for Expressing Knowledge Boundary of Black-Box LLM
Haotian Sheng, Heyong Wang, Ming Hong, Hongman He, Junqiu Liu

TL;DR
This paper introduces LSCL, a deep learning method that enables black-box LLMs to express their knowledge boundaries, improving understanding of their knowledge limits and reducing hallucinations.
Contribution
The paper presents a novel deep learning-based approach for knowledge boundary expression specifically tailored for black-box LLMs, which was previously underexplored.
Findings
LSCL outperforms baseline models in accuracy and recall.
The method effectively maps inputs to internal knowledge states.
An adaptive alternative performs nearly as well without token probability access.
Abstract
Large Language Models (LLMs) have achieved remarkable success, however, the emergence of content generation distortion (hallucination) limits their practical applications. The core cause of hallucination lies in LLMs' lack of awareness regarding their stored internal knowledge, preventing them from expressing their knowledge state on questions beyond their internal knowledge boundaries, as humans do. However, existing research on knowledge boundary expression primarily focuses on white-box LLMs, leaving methods suitable for black-box LLMs which offer only API access without revealing internal parameters-largely unexplored. Against this backdrop, this paper proposes LSCL (LLM-Supervised Confidence Learning), a deep learning-based method for expressing the knowledge boundaries of black-box LLMs. Based on the knowledge distillation framework, this method designs a deep learning model.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Misinformation and Its Impacts
