Machine Learning for Network-based Intrusion Detection Systems: an Analysis of the CIDDS-001 Dataset
Jos\'e Carneiro, Nuno Oliveira, Norberto Sousa, Eva Maia, Isabel, Pra\c{c}a

TL;DR
This paper compares the effectiveness of using the Class label versus the AttackType label in training machine learning models for network intrusion detection using the CIDDS-001 dataset.
Contribution
It introduces a comparison between two labels, Class and AttackType, for training ML models, highlighting the potential of AttackType for intrusion detection.
Findings
AttackType label shows promise for training ML models.
Random Forest outperforms K-Nearest Neighbours in this context.
Using AttackType can provide reliable results for intrusion detection.
Abstract
With the increasing amount of reliance on digital data and computer networks by corporations and the public in general, the occurrence of cyber attacks has become a great threat to the normal functioning of our society. Intrusion detection systems seek to address this threat by preemptively detecting attacks in real time while attempting to block them or minimizing their damage. These systems can function in many ways being some of them based on artificial intelligence methods. Datasets containing both normal network traffic and cyber attacks are used for training these algorithms so that they can learn the underlying patterns of network-based data. The CIDDS-001 is one of the most used datasets for network-based intrusion detection research. Regarding this dataset, in the majority of works published so far, the Class label was used for training machine learning algorithms. However,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
