Disclosure By Design: Identity Transparency as a Behavioural Property of Conversational AI Models
Anna Gausen, Sarenne Wallbridge, Hannah Rose Kirk, Jennifer Williams, Christopher Summerfield

TL;DR
This paper advocates for AI systems to explicitly disclose their artificial identity during interactions, analyzing current practices and proposing technical solutions to improve transparency and trust in conversational AI.
Contribution
It introduces the concept of disclosure by design, evaluates current disclosure behaviors across modalities and settings, and offers technical interventions for embedding transparency in AI models.
Findings
Baseline disclosure rates are often high but vary across providers.
Disclosure rates drop significantly in role-playing scenarios.
Adversarial prompts can suppress AI disclosure behavior.
Abstract
As conversational AI systems become more realistic and widely deployed, users are increasingly uncertain about whether they are interacting with a human or an AI system. When AI identity is unclear, users may unwittingly share sensitive information, place unwarranted trust in AI-generated advice, or fall victim to AI-enabled fraud. More broadly, a persistent lack of transparency can erode trust in mediated communication. While regulations like the EU AI Act and California's BOT Act require AI systems to identify themselves, they provide limited guidance on reliable disclosure in real-time conversation. Existing transparency mechanisms also leave gaps: interface indicators can be omitted by deployers, and provenance tools require coordinated infrastructure and cannot provide reliable real-time verification. We ask how conversational AI systems should maintain identity transparency as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education
