The Narrative Continuity Test: A Conceptual Framework for Evaluating Identity Persistence in AI Systems

Stefano Natangelo

arXiv:2510.24831·cs.CY·November 4, 2025

The Narrative Continuity Test: A Conceptual Framework for Evaluating Identity Persistence in AI Systems

Stefano Natangelo

PDF

TL;DR

The paper proposes the Narrative Continuity Test (NCT), a framework for assessing whether AI systems maintain identity and coherence over time, highlighting current limitations and guiding future architectural improvements.

Contribution

It introduces the NCT as a novel conceptual framework for evaluating identity persistence in AI, shifting focus from task performance to diachronic coherence.

Findings

01

Current architectures fail to support persistence axes

02

Case studies reveal predictable continuity failures

03

NCT guides future AI design for long-term coherence

Abstract

Artificial intelligence systems based on large language models (LLMs) can now generate coherent text, music, and images, yet they operate without a persistent state: each inference reconstructs context from scratch. This paper introduces the Narrative Continuity Test (NCT) -- a conceptual framework for evaluating identity persistence and diachronic coherence in AI systems. Unlike capability benchmarks that assess task performance, the NCT examines whether an LLM remains the same interlocutor across time and interaction gaps. The framework defines five necessary axes -- Situated Memory, Goal Persistence, Autonomous Self-Correction, Stylistic & Semantic Stability, and Persona/Role Continuity -- and explains why current architectures systematically fail to support them. Case analyses (Character.\,AI, Grok, Replit, Air Canada) show predictable continuity failures under stateless inference.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.