Understanding Human-AI Collaboration in Cybersecurity Competitions
Tingxuan Tang, Nicolas Janis, Kalyn Asher Montague, Kevin Eykholt, Dhilung Kirat, Youngja Park, Jiyong Jang, Adwait Nadkarni, Yue Xiao

TL;DR
This study explores how humans collaborate with AI in cybersecurity competitions, revealing insights into perception, trust, and performance, and benchmarking AI agents against human teams in real-world Capture-the-Flag contests.
Contribution
It provides the first empirical analysis of human-AI collaboration in live CTF competitions, including perception, collaboration dynamics, and performance benchmarking of autonomous AI agents.
Findings
Teams delegate more tasks to AI over time
Human prompting limits AI effectiveness
Autonomous AI agents outperform most human teams
Abstract
Capture-the-Flag (CTF) competitions are increasingly becoming a testbed for evaluating AI capabilities at solving security tasks, due to the controlled environments and objective success criteria. Existing evaluations have focused on how successful AI is at solving CTF challenges in isolation from human CTF players. As AI usage increases in both academic and industrial settings, it is equally likely that human players may collaborate with AI agents to solve challenges. This possibility exposes a key knowledge gap: how do humans perceive AI CTF assistance; when assistance is provided, how do they collaborate and is it effective with respect to human performance; how do humans assisted by AI compare to the performance of fully autonomous AI agents on the same challenges. We address this gap with the first empirical study of AI assistance in a live, onsite CTF. In a study with 41…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman-Automation Interaction and Safety · Ethics and Social Impacts of AI · Adversarial Robustness in Machine Learning
