IRCopilot: Automated Incident Response with Large Language Models
Xihuan Lin, Jie Zhang, Gelei Deng, Tianzhe Liu, Tianwei Zhang, Qing Guo, Riqing Chen

TL;DR
This paper introduces IRCopilot, a framework using large language models to automate incident response, addressing key challenges like hallucinations and context loss, and demonstrating superior performance over baseline models in real-world scenarios.
Contribution
We develop IRCopilot, a novel LLM-based incident response framework with modular components and strategic prompts, improving automation and effectiveness in complex cyber attack scenarios.
Findings
Outperforms baseline LLMs with over 114% sub-task completion rates
Demonstrates robustness on public incident response platforms
Effectively handles real-world attack scenarios
Abstract
Incident response plays a pivotal role in mitigating the impact of cyber attacks. In recent years, the intensity and complexity of global cyber threats have grown significantly, making it increasingly challenging for traditional threat detection and incident response methods to operate effectively in complex network environments. While Large Language Models (LLMs) have shown great potential in early threat detection, their capabilities remain limited when it comes to automated incident response after an intrusion. To address this gap, we construct an incremental benchmark based on real-world incident response tasks to thoroughly evaluate the performance of LLMs in this domain. Our analysis reveals several key challenges that hinder the practical application of contemporary LLMs, including context loss, hallucinations, privacy protection concerns, and their limited ability to provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Topic Modeling · Anomaly Detection Techniques and Applications
