Ensemble Classifier Design Tuned to Dataset Characteristics for Network Intrusion Detection
Zeinab Zoghi, Gursel Serpen

TL;DR
This paper introduces a dataset-specific ensemble classifier for network intrusion detection that addresses class imbalance and overlap issues, significantly improving performance on the UNSW-NB15 dataset.
Contribution
It proposes new algorithms to mitigate class overlap and tunes ensemble methods for dataset characteristics, enhancing intrusion detection accuracy.
Findings
Outperforms existing models on UNSW-NB15 dataset
Effectively handles class imbalance with tailored ensemble methods
Improves classification accuracy for both binary and multi-category tasks
Abstract
Machine Learning-based supervised approaches require highly customized and fine-tuned methodologies to deliver outstanding performance. This paper presents a dataset-driven design and performance evaluation of a machine learning classifier for the network intrusion dataset UNSW-NB15. Analysis of the dataset suggests that it suffers from class representation imbalance and class overlap in the feature space. We employed ensemble methods using Balanced Bagging (BB), eXtreme Gradient Boosting (XGBoost), and Random Forest empowered by Hellinger Distance Decision Tree (RF-HDDT). BB and XGBoost are tuned to handle the imbalanced data, and Random Forest (RF) classifier is supplemented by the Hellinger metric to address the imbalance issue. Two new algorithms are proposed to address the class overlap issue in the dataset. These two algorithms are leveraged to help improve the performance of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Anomaly Detection Techniques and Applications · Internet Traffic Analysis and Secure E-voting
MethodsBalanced Selection
