Understanding Human-AI Collaboration in Cybersecurity Competitions

Tingxuan Tang; Nicolas Janis; Kalyn Asher Montague; Kevin Eykholt; Dhilung Kirat; Youngja Park; Jiyong Jang; Adwait Nadkarni; Yue Xiao

arXiv:2602.20446·cs.CR·February 25, 2026

Understanding Human-AI Collaboration in Cybersecurity Competitions

Tingxuan Tang, Nicolas Janis, Kalyn Asher Montague, Kevin Eykholt, Dhilung Kirat, Youngja Park, Jiyong Jang, Adwait Nadkarni, Yue Xiao

PDF

Open Access

TL;DR

This study explores how humans collaborate with AI in cybersecurity competitions, revealing insights into perception, trust, and performance, and benchmarking AI agents against human teams in real-world Capture-the-Flag contests.

Contribution

It provides the first empirical analysis of human-AI collaboration in live CTF competitions, including perception, collaboration dynamics, and performance benchmarking of autonomous AI agents.

Findings

01

Teams delegate more tasks to AI over time

02

Human prompting limits AI effectiveness

03

Autonomous AI agents outperform most human teams

Abstract

Capture-the-Flag (CTF) competitions are increasingly becoming a testbed for evaluating AI capabilities at solving security tasks, due to the controlled environments and objective success criteria. Existing evaluations have focused on how successful AI is at solving CTF challenges in isolation from human CTF players. As AI usage increases in both academic and industrial settings, it is equally likely that human players may collaborate with AI agents to solve challenges. This possibility exposes a key knowledge gap: how do humans perceive AI CTF assistance; when assistance is provided, how do they collaborate and is it effective with respect to human performance; how do humans assisted by AI compare to the performance of fully autonomous AI agents on the same challenges. We address this gap with the first empirical study of AI assistance in a live, onsite CTF. In a study with 41…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman-Automation Interaction and Safety · Ethics and Social Impacts of AI · Adversarial Robustness in Machine Learning