Machine Learning for Encrypted Malicious Traffic Detection: Approaches,   Datasets and Comparative Study

Zihao Wang; Kar-Wai Fok; Vrizlynn L. L. Thing

arXiv:2203.09332·cs.CR·November 21, 2022

Machine Learning for Encrypted Malicious Traffic Detection: Approaches, Datasets and Comparative Study

Zihao Wang, Kar-Wai Fok, Vrizlynn L. L. Thing

PDF

TL;DR

This paper reviews machine learning methods for detecting malicious encrypted traffic, creates a comprehensive dataset from multiple sources, and compares ten detection algorithms to advance research in this critical cybersecurity area.

Contribution

It formulates a universal framework for encrypted malicious traffic detection, combines datasets for fair comparison, and evaluates multiple algorithms systematically.

Findings

01

10 detection algorithms compared on a unified dataset

02

Identified challenges in dataset standardization and model performance

03

Proposed future research directions in encrypted traffic detection

Abstract

As people's demand for personal privacy and data security becomes a priority, encrypted traffic has become mainstream in the cyber world. However, traffic encryption is also shielding malicious and illegal traffic introduced by adversaries, from being detected. This is especially so in the post-COVID-19 environment where malicious traffic encryption is growing rapidly. Common security solutions that rely on plain payload content analysis such as deep packet inspection are rendered useless. Thus, machine learning based approaches have become an important direction for encrypted malicious traffic detection. In this paper, we formulate a universal framework of machine learning based encrypted malicious traffic detection techniques and provided a systematic review. Furthermore, current research adopts different datasets to train their models due to the lack of well-recognized datasets and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.