IRCopilot: Automated Incident Response with Large Language Models

Xihuan Lin; Jie Zhang; Gelei Deng; Tianzhe Liu; Tianwei Zhang; Qing Guo; Riqing Chen

arXiv:2505.20945·cs.CR·October 31, 2025

IRCopilot: Automated Incident Response with Large Language Models

Xihuan Lin, Jie Zhang, Gelei Deng, Tianzhe Liu, Tianwei Zhang, Qing Guo, Riqing Chen

PDF

Open Access

TL;DR

This paper introduces IRCopilot, a framework using large language models to automate incident response, addressing key challenges like hallucinations and context loss, and demonstrating superior performance over baseline models in real-world scenarios.

Contribution

We develop IRCopilot, a novel LLM-based incident response framework with modular components and strategic prompts, improving automation and effectiveness in complex cyber attack scenarios.

Findings

01

Outperforms baseline LLMs with over 114% sub-task completion rates

02

Demonstrates robustness on public incident response platforms

03

Effectively handles real-world attack scenarios

Abstract

Incident response plays a pivotal role in mitigating the impact of cyber attacks. In recent years, the intensity and complexity of global cyber threats have grown significantly, making it increasingly challenging for traditional threat detection and incident response methods to operate effectively in complex network environments. While Large Language Models (LLMs) have shown great potential in early threat detection, their capabilities remain limited when it comes to automated incident response after an intrusion. To address this gap, we construct an incremental benchmark based on real-world incident response tasks to thoroughly evaluate the performance of LLMs in this domain. Our analysis reveals several key challenges that hinder the practical application of contemporary LLMs, including context loss, hallucinations, privacy protection concerns, and their limited ability to provide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · Topic Modeling · Anomaly Detection Techniques and Applications