Visibility into AI Agents
Alan Chan, Carson Ezell, Max Kaufmann, Kevin Wei, Lewis Hammond,, Herbie Bradley, Emma Bluemke, Nitarshan Rajkumar, David Krueger, Noam Kolt,, Lennart Heim, Markus Anderljung

TL;DR
This paper explores methods to improve transparency and accountability of AI agents through identifiers, monitoring, and logging, analyzing their implementation, privacy implications, and impact on power dynamics.
Contribution
It provides a comprehensive assessment of visibility measures for AI agents, detailing their implementation, applicability, and societal implications.
Findings
Visibility measures vary in intrusiveness and informativeness.
Application of measures depends on deployment context and actors involved.
Enhanced visibility can support governance but raises privacy and power concerns.
Abstract
Increased delegation of commercial, scientific, governmental, and personal activities to AI agents -- systems capable of pursuing complex goals with limited supervision -- may exacerbate existing societal risks and introduce new risks. Understanding and mitigating these risks involves critically evaluating existing governance structures, revising and adapting these structures where needed, and ensuring accountability of key stakeholders. Information about where, why, how, and by whom certain AI agents are used, which we refer to as visibility, is critical to these objectives. In this paper, we assess three categories of measures to increase visibility into AI agents: agent identifiers, real-time monitoring, and activity logging. For each, we outline potential implementations that vary in intrusiveness and informativeness. We analyze how the measures apply across a spectrum of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlockchain Technology Applications and Security · Ethics and Social Impacts of AI · Cybercrime and Law Enforcement Studies
Methodstravel james
