From Implicit to Explicit: Enhancing Self-Recognition in Large Language Models
Yinghan Zhou, Weifeng Zhu, Juan Wen, Wanli Peng, Zhengxian Wu, Yiming Xue

TL;DR
This paper investigates why large language models struggle with self-recognition in individual text scenarios and introduces a novel framework, CoSur, that significantly improves their ability to identify self-generated texts.
Contribution
The paper identifies implicit self-recognition as the cause of failure and proposes CoSur, a new framework with four modules, to enhance self-recognition in LLMs under the IPP scenario.
Findings
Achieved over 97% accuracy in self-recognition across three LLMs.
Identified implicit self-recognition as the key failure point.
Demonstrated effectiveness of CoSur in improving self-recognition performance.
Abstract
Large language models (LLMs) have been shown to possess a degree of self-recognition ability, which used to identify whether a given text was generated by themselves. Prior work has demonstrated that this capability is reliably expressed under the pair presentation paradigm (PPP), where the model is presented with two texts and asked to choose which one it authored. However, performance deteriorates sharply under the individual presentation paradigm (IPP), where the model is given a single text to judge authorship. Although this phenomenon has been observed, its underlying causes have not been systematically analyzed. In this paper, we first investigate the cause of this failure and attribute it to implicit self-recognition (ISR). ISR describes the gap between internal representations and output behavior in LLMs: under the IPP scenario, the model encodes self-recognition information in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDental Education, Practice, Research
