Evolution of Log-Based Detection Rules in Public Repositories
Minjun Long, David Evans

TL;DR
This study longitudinally analyzes how log-based detection rules evolve in public repositories, revealing non-monotonic changes and operational trade-offs rather than convergence.
Contribution
It introduces a predicate graph representation and alignment method to compare rule logic over time, providing new insights into detection rule evolution.
Findings
56% of rules undergo at least one logical revision
Over half of rules add and remove clauses over time
Approximately 25-33% of rules alternate between expanding and reducing coverage
Abstract
Log-based detection rules remain central to modern security operations, encoding domain expertise that analysts iteratively refine to balance detection coverage against alert volume. Yet while prior work has examined the evolution of network intrusion detection signatures, the longitudinal behavior of log-based detection rules has received little empirical study. We present the first longitudinal analysis of detection rule evolution across two widely used repositories: the community-driven Sigma project and the curated Splunk Security Content (SSC). To compare rule versions based on detection logic rather than surface syntax, we introduce a predicate graph intermediate representation that canonicalizes the logical structure of a rule, together with a tree alignment procedure for analyzing changes across revisions. We apply this method to 6,859 rule histories from Sigma and SSC and find…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
