LESS: Efficient Log Storage System Based on Learned Model and Minimum Attribute Tree
Zhiyang Cheng, Zizhen Zhu, Haoran Dang, Hai Wan, Xibin Zhao

TL;DR
LESS is a novel provenance graph storage system that significantly reduces storage space and query time by partitioning graph components and employing machine learning and minimal spanning trees, enhancing long-term cyber attack investigations.
Contribution
The paper introduces a new provenance graph storage system, LESS, which separates structure and attributes, applying machine learning and minimal spanning trees for efficient, lossless storage and fast querying.
Findings
Reduces storage time by 6.29 times compared to LEONARD.
Achieves 5.24 times less disk usage.
Enables 18.3 times faster query speed with minimal memory usage.
Abstract
In recent years, cyber attacks have become increasingly sophisticated and persistent. Detection and investigation based on the provenance graph can effectively mitigate cyber intrusion. However, in the long time span of defenses, the sheer size of the provenance graph will pose significant challenges to the storage systems. Faced with long-term storage tasks, existing methods are unable to simultaneously achieve lossless information, efficient compression, and fast query support. In this paper, we propose a novel provenance graph storage system, LESS, which consumes smaller storage space and supports faster storage and queries compared to current approaches. We innovatively partition the provenance graph into two distinct components, the graph structure and attribute, and store them separately. Based on their respective characteristics, we devise two appropriate storage schemes: the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Power Systems and Technologies · Advanced Algorithms and Applications
