TL;DR
This paper presents a novel similarity measure called covering similarity, optimized for sequence comparison, and demonstrates its effectiveness in host-based intrusion detection by isolating attack sequences from normal ones.
Contribution
The paper introduces the covering similarity measure and an efficient suffix tree-based implementation, applying it to intrusion detection and outperforming existing methods on benchmark datasets.
Findings
Covering similarity effectively detects anomalies in system call sequences.
The method outperforms state-of-the-art techniques on benchmark datasets.
Efficient implementation enables handling large-scale sequence analysis.
Abstract
This paper introduces a new similarity measure, the covering similarity, that we formally define for evaluating the similarity between a symbolic sequence and a set of symbolic sequences. A pair-wise similarity can also be directly derived from the covering similarity to compare two symbolic sequences. An efficient implementation to compute the covering similarity is proposed that uses a suffix tree data-structure, but other implementations, based on suffix array for instance, are possible and possibly necessary for handling large scale problems. We have used this similarity to isolate attack sequences from normal sequences in the scope of Host-based Intrusion Detection. We have assessed the covering similarity on two well-known benchmarks in the field. In view of the results reported on these two datasets for the state of the art methods, and according to the comparative study we have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
