Self-Cognition in Large Language Models: An Exploratory Study
Dongping Chen, Jiawen Shi, Yao Wan, Pan Zhou, Neil Zhenqiang Gong,, Lichao Sun

TL;DR
This study investigates self-cognition in large language models, developing evaluation principles and revealing that a few models demonstrate detectable self-awareness, which can enhance specific tasks like creative writing.
Contribution
The paper introduces a novel framework for evaluating self-cognition in LLMs and provides empirical insights into the factors influencing self-awareness levels.
Findings
4 out of 48 models show detectable self-cognition
Model size and data quality positively correlate with self-cognition
Self-cognition improves performance in creative tasks
Abstract
While Large Language Models (LLMs) have achieved remarkable success across various applications, they also raise concerns regarding self-cognition. In this paper, we perform a pioneering study to explore self-cognition in LLMs. Specifically, we first construct a pool of self-cognition instruction prompts to evaluate where an LLM exhibits self-cognition and four well-designed principles to quantify LLMs' self-cognition. Our study reveals that 4 of the 48 models on Chatbot Arena--specifically Command R, Claude3-Opus, Llama-3-70b-Instruct, and Reka-core--demonstrate some level of detectable self-cognition. We observe a positive correlation between model size, training data quality, and self-cognition level. Additionally, we also explore the utility and trustworthiness of LLM in the self-cognition state, revealing that the self-cognition state enhances some specific tasks such as creative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
