ProvG-Searcher: A Graph Representation Learning Approach for Efficient Provenance Graph Search
Enes Altinisik, Fatih Deniz, Husrev Taha Sencar

TL;DR
ProvG-Searcher introduces a graph embedding-based method for efficient and accurate detection of known APT behaviors in system logs by transforming provenance graphs into vector representations for quick subgraph matching.
Contribution
It presents a novel graph representation learning approach that simplifies provenance graph search through order embeddings and graph partitioning, improving efficiency and accuracy.
Findings
Achieves over 99% accuracy in behavior detection
Maintains a false positive rate around 0.02%
Outperforms existing methods in experimental evaluations
Abstract
We present ProvG-Searcher, a novel approach for detecting known APT behaviors within system security logs. Our approach leverages provenance graphs, a comprehensive graph representation of event logs, to capture and depict data provenance relations by mapping system entities as nodes and their interactions as edges. We formulate the task of searching provenance graphs as a subgraph matching problem and employ a graph representation learning method. The central component of our search methodology involves embedding of subgraphs in a vector space where subgraph relationships can be directly evaluated. We achieve this through the use of order embeddings that simplify subgraph matching to straightforward comparisons between a query and precomputed subgraph representations. To address challenges posed by the size and complexity of provenance graphs, we propose a graph partitioning scheme and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Software System Performance and Reliability · Advanced Graph Neural Networks
