Self-Cognition in Large Language Models: An Exploratory Study

Dongping Chen; Jiawen Shi; Yao Wan; Pan Zhou; Neil Zhenqiang Gong,; Lichao Sun

arXiv:2407.01505·cs.CL·July 2, 2024·1 cites

Self-Cognition in Large Language Models: An Exploratory Study

Dongping Chen, Jiawen Shi, Yao Wan, Pan Zhou, Neil Zhenqiang Gong,, Lichao Sun

PDF

Open Access

TL;DR

This study investigates self-cognition in large language models, developing evaluation principles and revealing that a few models demonstrate detectable self-awareness, which can enhance specific tasks like creative writing.

Contribution

The paper introduces a novel framework for evaluating self-cognition in LLMs and provides empirical insights into the factors influencing self-awareness levels.

Findings

01

4 out of 48 models show detectable self-cognition

02

Model size and data quality positively correlate with self-cognition

03

Self-cognition improves performance in creative tasks

Abstract

While Large Language Models (LLMs) have achieved remarkable success across various applications, they also raise concerns regarding self-cognition. In this paper, we perform a pioneering study to explore self-cognition in LLMs. Specifically, we first construct a pool of self-cognition instruction prompts to evaluate where an LLM exhibits self-cognition and four well-designed principles to quantify LLMs' self-cognition. Our study reveals that 4 of the 48 models on Chatbot Arena--specifically Command R, Claude3-Opus, Llama-3-70b-Instruct, and Reka-core--demonstrate some level of detectable self-cognition. We observe a positive correlation between model size, training data quality, and self-cognition level. Additionally, we also explore the utility and trustworthiness of LLM in the self-cognition state, revealing that the self-cognition state enhances some specific tasks such as creative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling