MAWIFlow Benchmark: Realistic Flow-Based Evaluation for Network Intrusion Detection
Joshua Schraven, Alexander Windmann, Oliver Niggemann

TL;DR
This paper introduces MAWIFlow, a realistic flow-based benchmark derived from real traffic data, to improve the evaluation of network intrusion detection methods by reflecting true operational variability and temporal drift.
Contribution
The paper presents MAWIFlow, a new realistic dataset with a reproducible preprocessing pipeline and baseline evaluations, highlighting the limitations of synthetic benchmarks and static models.
Findings
Tree-based classifiers degrade over time on static data
CNN-BiLSTM maintains better performance over time
Realistic datasets improve evaluation of intrusion detection methods
Abstract
Benchmark datasets for network intrusion detection commonly rely on synthetically generated traffic, which fails to reflect the statistical variability and temporal drift encountered in operational environments. This paper introduces MAWIFlow, a flow-based benchmark derived from the MAWILAB v1.1 dataset, designed to enable realistic and reproducible evaluation of anomaly detection methods. A reproducible preprocessing pipeline is presented that transforms raw packet captures into flow representations conforming to the CICFlowMeter format, while preserving MAWILab's original anomaly labels. The resulting datasets comprise temporally distinct samples from January 2011, 2016, and 2021, drawn from trans-Pacific backbone traffic. To establish reference baselines, traditional machine learning methods, including Decision Trees, Random Forests, XGBoost, and Logistic Regression, are compared…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Internet Traffic Analysis and Secure E-voting · Anomaly Detection Techniques and Applications
