Attack vs Benign Network Intrusion Traffic Classification
M. Andrecut

TL;DR
This paper demonstrates that a simple nearest neighbor classifier can effectively detect network intrusions using the CSE-CIC-IDS2018 dataset, challenging the need for complex machine learning models.
Contribution
It shows that a straightforward nearest neighbor approach achieves comparable results to complex models in intrusion detection, emphasizing simplicity and robustness.
Findings
Nearest neighbor classifier performs well on IDS data
Simple methods can rival complex deep learning models
Highlights over-engineering in current ML approaches
Abstract
Intrusion detection systems (IDS) are used to monitor networks or systems for attack activity or policy violations. Such a system should be able to successfully identify anomalous deviations from normal traffic behavior. Here we discuss the machine learning approach to building an anomaly-based IDS using the CSE-CIC-IDS2018 dataset. Since the publication of this dataset a relatively large number of papers have been published, most of them presenting IDS architectures and results based on complex machine learning methods, like deep neural networks, gradient boosting classifiers, or hidden Markov models. Here we show that similar results can be obtained using a very simple nearest neighbor classification approach, avoiding the inherent complications of training such complex models. The advantages of the nearest neighbor algorithm are: (1) it is very simple to implement; (2) it is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Anomaly Detection Techniques and Applications · Internet Traffic Analysis and Secure E-voting
