Automated Ransomware Behavior Analysis: Pattern Extraction and Early Detection
Qian Chen, Sheikh Rabiul Islam, Henry Haswell, Robert A. Bridges

TL;DR
This paper introduces an automated tool that uses machine learning to analyze malware behavior, extract patterns, and enable early detection, reducing manual effort in SOC investigations.
Contribution
The paper presents a novel automated malware pattern extraction and detection system utilizing three machine learning methods, validated on multiple ransomware samples.
Findings
TF-IDF outperforms other methods in feature discrimination
Extra Trees (ET) is the most time-efficient and robust approach
The tool effectively identifies malware behavior patterns and aids forensic analysis
Abstract
Security operation centers (SOCs) typically use a variety of tools to collect large volumes of host logs for detection and forensic of intrusions. Our experience, supported by recent user studies on SOC operators, indicates that operators spend ample time (e.g., hundreds of man-hours) on investigations into logs seeking adversarial actions. Similarly, reconfiguration of tools to adapt detectors for future similar attacks is commonplace upon gaining novel insights (e.g., through internal investigation or shared indicators). This paper presents an automated malware pattern-extraction and early detection tool, testing three machine learning approaches: TF-IDF (term frequency-inverse document frequency), Fisher's LDA (linear discriminant analysis) and ET (extra trees/extremely randomized trees) that can (1) analyze freshly discovered malware samples in sandboxes and generate dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
