Survey on Factuality in Large Language Models: Knowledge, Retrieval and   Domain-Specificity

Cunxiang Wang; Xiaoze Liu; Yuanhao Yue; Xiangru Tang; Tianhang Zhang,; Cheng Jiayang; Yunzhi Yao; Wenyang Gao; Xuming Hu; Zehan Qi; Yidong Wang,; Linyi Yang; Jindong Wang; Xing Xie; Zheng Zhang; Yue Zhang

arXiv:2310.07521·cs.CL·December 19, 2023·52 cites

Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

Cunxiang Wang, Xiaoze Liu, Yuanhao Yue, Xiangru Tang, Tianhang Zhang,, Cheng Jiayang, Yunzhi Yao, Wenyang Gao, Xuming Hu, Zehan Qi, Yidong Wang,, Linyi Yang, Jindong Wang, Xing Xie, Zheng Zhang, Yue Zhang

PDF

Open Access 1 Repo

TL;DR

This survey reviews the challenges, evaluation methods, and strategies for improving factual accuracy in Large Language Models across various domains, emphasizing both standalone and retrieval-augmented configurations.

Contribution

It provides a comprehensive overview of factuality issues in LLMs, analyzing mechanisms, evaluation metrics, and domain-specific enhancement strategies.

Findings

01

Identification of key challenges in LLM factuality

02

Analysis of evaluation benchmarks and metrics

03

Discussion of domain-specific factuality improvement approaches

Abstract

This survey addresses the crucial issue of factuality in Large Language Models (LLMs). As LLMs find applications across diverse domains, the reliability and accuracy of their outputs become vital. We define the Factuality Issue as the probability of LLMs to produce content inconsistent with established facts. We first delve into the implications of these inaccuracies, highlighting the potential consequences and challenges posed by factual errors in LLM outputs. Subsequently, we analyze the mechanisms through which LLMs store and process facts, seeking the primary causes of factual errors. Our discussion then transitions to methodologies for evaluating LLM factuality, emphasizing key metrics, benchmarks, and studies. We further explore strategies for enhancing LLM factuality, including approaches tailored for specific domains. We focus two primary LLM configurations standalone LLMs and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wangcunxiang/llm-factuality-survey
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods

MethodsFocus