Synthetic flow-based cryptomining attack generation through Generative   Adversarial Networks

Alberto Mozo; \'Angel Gonz\'alez-Prieto; Antonio Pastor; Sandra; G\'omez-Canaval; Edgar Talavera

arXiv:2107.14776·cs.CR·February 9, 2022

Synthetic flow-based cryptomining attack generation through Generative Adversarial Networks

Alberto Mozo, \'Angel Gonz\'alez-Prieto, Antonio Pastor, Sandra, G\'omez-Canaval, Edgar Talavera

PDF

TL;DR

This paper introduces a novel deterministic method to evaluate and select high-quality synthetic network traffic generated by GANs, enabling privacy-preserving training of intrusion detection systems that can fully replace real data.

Contribution

It proposes a new deterministic quality measure for GAN-generated data and heuristics for selecting optimal generators, improving synthetic data utility and privacy in network intrusion detection.

Findings

01

Synthetic traffic can fully replace real data in ML training for cryptomining detection.

02

The proposed method achieves comparable detection performance without privacy breaches.

03

Heuristics effectively select the best GAN models during training.

Abstract

Due to the growing rise of cyber attacks in the Internet, flow-based data sets are crucial to increase the performance of the Machine Learning (ML) components that run in network-based intrusion detection systems (IDS). To overcome the existing network traffic data shortage in attack analysis, recent works propose Generative Adversarial Networks (GANs) for synthetic flow-based network traffic generation. Data privacy is appearing more and more as a strong requirement when processing such network data, which suggests to find solutions where synthetic data can fully replace real data. Because of the ill-convergence of the GAN training, none of the existing solutions can generate high-quality fully synthetic data that can totally substitute real data in the training of IDS ML components. Therefore, they mix real with synthetic data, which acts only as data augmentation components, leading…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.