Supervised Feature Selection Techniques in Network Intrusion Detection: a Critical Review
Mario Di Mauro, Giovanni Galatro, Giancarlo Fortino, Antonio Liotta

TL;DR
This paper critically reviews supervised feature selection techniques in network intrusion detection, emphasizing recent datasets, various FS approaches, and their impact on performance and resource efficiency.
Contribution
It provides a comprehensive evaluation of recent feature selection methods, including evolutionary techniques, using a new dataset and detailed experimental analysis.
Findings
Feature selection reduces training time without sacrificing accuracy.
Evolutionary techniques effectively identify significant features.
Trade-offs exist between performance and resource consumption.
Abstract
Machine Learning (ML) techniques are becoming an invaluable support for network intrusion detection, especially in revealing anomalous flows, which often hide cyber-threats. Typically, ML algorithms are exploited to classify/recognize data traffic on the basis of statistical features such as inter-arrival times, packets length distribution, mean number of flows, etc. Dealing with the vast diversity and number of features that typically characterize data traffic is a hard problem. This results in the following issues: i) the presence of so many features leads to lengthy training processes (particularly when features are highly correlated), while prediction accuracy does not proportionally improve; ii) some of the features may introduce bias during the classification process, particularly those that have scarce relation with the data traffic to be classified. To this end, by reducing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Selection
