Disclosure By Design: Identity Transparency as a Behavioural Property of Conversational AI Models

Anna Gausen; Sarenne Wallbridge; Hannah Rose Kirk; Jennifer Williams; Christopher Summerfield

arXiv:2603.16874·cs.HC·March 19, 2026

Disclosure By Design: Identity Transparency as a Behavioural Property of Conversational AI Models

Anna Gausen, Sarenne Wallbridge, Hannah Rose Kirk, Jennifer Williams, Christopher Summerfield

PDF

Open Access

TL;DR

This paper advocates for AI systems to explicitly disclose their artificial identity during interactions, analyzing current practices and proposing technical solutions to improve transparency and trust in conversational AI.

Contribution

It introduces the concept of disclosure by design, evaluates current disclosure behaviors across modalities and settings, and offers technical interventions for embedding transparency in AI models.

Findings

01

Baseline disclosure rates are often high but vary across providers.

02

Disclosure rates drop significantly in role-playing scenarios.

03

Adversarial prompts can suppress AI disclosure behavior.

Abstract

As conversational AI systems become more realistic and widely deployed, users are increasingly uncertain about whether they are interacting with a human or an AI system. When AI identity is unclear, users may unwittingly share sensitive information, place unwarranted trust in AI-generated advice, or fall victim to AI-enabled fraud. More broadly, a persistent lack of transparency can erode trust in mediated communication. While regulations like the EU AI Act and California's BOT Act require AI systems to identify themselves, they provide limited guidance on reliable disclosure in real-time conversation. Existing transparency mechanisms also leave gaps: interface indicators can be omitted by deployers, and provenance tools require coordinated infrastructure and cannot provide reliable real-time verification. We ask how conversational AI systems should maintain identity transparency as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education