MalPhase: Fine-Grained Malware Detection Using Network Flow Data

Michal Piskozub; Fabio De Gaspari; Frederick Barr-Smith; Luigi V.; Mancini; Ivan Martinovic

arXiv:2106.00541·cs.CR·June 2, 2021

MalPhase: Fine-Grained Malware Detection Using Network Flow Data

Michal Piskozub, Fabio De Gaspari, Frederick Barr-Smith, Luigi V., Mancini, Ivan Martinovic

PDF

TL;DR

MalPhase is a multi-phase deep learning system that effectively detects and classifies malware from network flow data, even in noisy, real-world traffic conditions, achieving high accuracy and robustness.

Contribution

MalPhase introduces a novel multi-tier architecture with extended features and denoising autoencoders for improved malware detection and classification from aggregated network flows.

Findings

01

Detects malicious flows with >98% F1 score

02

Classifies malware type with >93% F1 score

03

Performs well on mixed benign and malicious traffic

Abstract

Economic incentives encourage malware authors to constantly develop new, increasingly complex malware to steal sensitive data or blackmail individuals and companies into paying large ransoms. In 2017, the worldwide economic impact of cyberattacks is estimated to be between 445 and 600 billion USD, or 0.8% of global GDP. Traditionally, one of the approaches used to defend against malware is network traffic analysis, which relies on network data to detect the presence of potentially malicious software. However, to keep up with increasing network speeds and amount of traffic, network analysis is generally limited to work on aggregated network data, which is traditionally challenging and yields mixed results. In this paper we present MalPhase, a system that was designed to cope with the limitations of aggregated flows. MalPhase features a multi-phase pipeline for malware detection, type and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.