LaMsS: When Large Language Models Meet Self-Skepticism
Yetao Wu, Yihong Wang, Teng Chen, Ningyuan Xi, Qingqing Gu, Hongyang, Lei, Luo Ji

TL;DR
LaMsS introduces a self-skepticism mechanism for large language models, enhancing their ability to self-assess and reduce hallucinations, leading to improved accuracy and robustness across multiple tasks.
Contribution
This paper presents a novel self-skepticism approach for LLMs, integrating skepticism tokens to enable self-awareness and improve factual accuracy.
Findings
Outperforms baselines on multiple-choice and open-domain QA
Generalizes well to multi-task and out-of-domain settings
Improves accuracy, AUC, and AP in self-aware answering
Abstract
Hallucination is a major challenge for large language models (LLMs), preventing their further application in some fields. The skeptical thinking of humankind could be useful for LLMs to self-cognition, self-reflection and alleviate their hallucinations. Inspired by this consideration, we propose a novel approach called LaMsS, which combines the semantic understanding capability of LLMs with self-skepticism. By introducing a series of skepticism tokens and augmenting them into the vocabulary, we conduct both pertaining and finetuning, which allow the LLM to decode each normal token followed by a skeptical token, representing different skepticism levels. By calculating the response skepticism given a query, one can define a new self-aware LLM which is only willing to answer with relative lower skepticism level than the threshold. By examining the accuracy, AUC and AP of willingly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEpilepsy research and treatment · Machine Learning in Healthcare
MethodsAttentive Walk-Aggregating Graph Neural Network
