"What Did It Actually Do?": Understanding Risk Awareness and Traceability for Computer-Use Agents

Zifan Peng; Mingchen Li

arXiv:2603.28551·cs.CR·April 1, 2026

"What Did It Actually Do?": Understanding Risk Awareness and Traceability for Computer-Use Agents

Zifan Peng, Mingchen Li

PDF

TL;DR

This paper explores risk awareness and traceability in personalized computer-use agents, proposing a framework to improve user understanding of agent actions, permissions, and residual effects for safer deployment.

Contribution

It introduces AgentTrace, a traceability framework with a prototype interface that enhances understanding and auditability of agent behaviors and their impacts.

Findings

01

Participants recognized agents as risky but lacked concrete mental models.

02

Traceability interfaces improved understanding of agent actions.

03

Enhanced traceability supports anomaly detection and calibrated trust.

Abstract

Personalized computer-use agents are rapidly moving from expert communities into mainstream use. Unlike conventional chatbots, these systems can install skills, invoke tools, access private resources, and modify local environments on users' behalf. Yet users often do not know what authority they have delegated, what the agent actually did during task execution, or whether the system has been safely removed afterward. We investigate this gap as a combined problem of risk understanding and post-hoc auditability, using OpenClaw as a motivating case. We first build a multi-source corpus of the OpenClaw ecosystem, including incidents, advisories, malicious-skill reports, news coverage, tutorials, and social-media narratives. We then conduct an interview study to examine how users and practitioners understand skills, autonomy, privilege, persistence, and uninstallation. Our findings suggest…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.