MalPhase: Fine-Grained Malware Detection Using Network Flow Data
Michal Piskozub, Fabio De Gaspari, Frederick Barr-Smith, Luigi V., Mancini, Ivan Martinovic

TL;DR
MalPhase is a multi-phase deep learning system that effectively detects and classifies malware from network flow data, even in noisy, real-world traffic conditions, achieving high accuracy and robustness.
Contribution
MalPhase introduces a novel multi-tier architecture with extended features and denoising autoencoders for improved malware detection and classification from aggregated network flows.
Findings
Detects malicious flows with >98% F1 score
Classifies malware type with >93% F1 score
Performs well on mixed benign and malicious traffic
Abstract
Economic incentives encourage malware authors to constantly develop new, increasingly complex malware to steal sensitive data or blackmail individuals and companies into paying large ransoms. In 2017, the worldwide economic impact of cyberattacks is estimated to be between 445 and 600 billion USD, or 0.8% of global GDP. Traditionally, one of the approaches used to defend against malware is network traffic analysis, which relies on network data to detect the presence of potentially malicious software. However, to keep up with increasing network speeds and amount of traffic, network analysis is generally limited to work on aggregated network data, which is traditionally challenging and yields mixed results. In this paper we present MalPhase, a system that was designed to cope with the limitations of aggregated flows. MalPhase features a multi-phase pipeline for malware detection, type and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
