SOCpilot: Verifying Policy Compliance for LLM-Assisted Incident Response

Sidnei Barbieri; Leonardo Vaz de Meneses; \'Agney Lopes Roth Ferraz; Louren\c{c}o Alves Pereira J\'unior

arXiv:2605.05501·cs.CR·May 8, 2026

SOCpilot: Verifying Policy Compliance for LLM-Assisted Incident Response

Sidnei Barbieri, Leonardo Vaz de Meneses, \'Agney Lopes Roth Ferraz, Louren\c{c}o Alves Pereira J\'unior

PDF

TL;DR

SOCpilot is a framework that verifies LLM-generated incident response plans against compliance policies, ensuring mandatory steps and approval gates are respected, with evaluation on real SOC incidents.

Contribution

It introduces a measurable compliance verification method for LLM-assisted incident response plans, including a fixed artifact and an open-source verifier tool.

Findings

01

466 non-compliant actions removed by verifier

02

Plans remain consistent across reruns

03

Artifact enables zero-cost compliance checks

Abstract

Security operations centers (SOCs) are beginning to use large language models (LLMs) as copilots to draft incident-response plans. These plans may include actions that are valid per the catalog but still violate mandatory steps, required ordering, or approval gates before analyst review. SOCpilot makes this compliance question measurable at the plan boundary. It fixes the incident package, action catalog, policy rules, verifier, and public evidence surface. Next, it verifies the copilot's proposed action trace. We evaluate two LLM providers on 200 real incidents from an anonymized production SOC in a financial-sector case study. We compare their plans to paired analyst-authored references from the same security orchestration, automation, and response (SOAR) cases. An identical inline policy text moves the two providers in opposite directions. A deterministic verifier removes 466…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.