Don't Let AI Agents YOLO Your Files: Shifting Information and Control to Filesystems for Agent Safety and Autonomy
Shawn Wanxiang Zhong, Junxuan Liao, Jing Liu, Mai Zheng, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau

TL;DR
This paper introduces YoloFS, a filesystem-integrated approach that enhances AI agent safety and autonomy by providing better control and transparency over filesystem effects.
Contribution
It proposes shifting filesystem information and control to the filesystem itself, enabling safer and more autonomous AI agents through novel techniques like staging, snapshots, and progressive permission.
Findings
YoloFS enables agent self-correction in 8 out of 11 tasks with hidden side effects.
YoloFS reduces user interactions on routine tasks while maintaining success rates.
The methodology effectively captures user-agent-filesystem interactions.
Abstract
AI coding agents operate directly on users' filesystems, where they regularly corrupt data, delete files, and leak secrets. Current approaches force a tradeoff between safety and autonomy: unrestricted access risks harm, while frequent permission prompts burden users and block agents. To understand this problem, we conduct the first systematic study of agent filesystem misuse, analyzing 290 public reports across 13 frameworks. Our analysis reveals that today's agents have limited information about their filesystem effects and insufficient control over them. We therefore argue for shifting this information and control to the filesystem itself. Based on this principle, we design YoloFS, an agent-native filesystem with three techniques. Staging isolates all mutations before commit, giving users corrective control. Snapshots extend this control to agents, letting them detect and correct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
