Assessing Generalisation Capability of Machine Learning Models for Intrusion Detection
Md Zakir Hossain, Md Ayshik Rahman Khan, Md Rafiqul Islam, Syed Mohammed Shamsul Islam, and Tom Gedeon

TL;DR
This paper evaluates the generalisation ability of machine learning models for intrusion detection across different datasets, revealing significant performance drops in cross-dataset scenarios and emphasizing the need for adaptive models.
Contribution
It provides an empirical analysis of the generalisation gap in ML-based intrusion detection, highlighting the limitations of current models across diverse network environments.
Findings
Random Forest achieved over 95% accuracy on same dataset
Performance dropped below 40% in cross-dataset testing
Significant generalisation gap identified in intrusion detection models
Abstract
The growth of networked and IoT systems has intensified cyber-security threats and exposed the limits of traditional signature-based intrusion detection. Although machine-learning-based intrusion detection systems often report strong benchmark performance, high ac- curacy within a single dataset does not necessarily guarantee reliable performance in unseen network environments. This study investigates the generalisation capability of supervised machine learning models for intrusion detection using UNSW-NB15 and TON_IoT. Random Forest, Logistic Regression, and Naive Bayes were evaluated under same-dataset and cross-dataset settings. Random Forest achieved the strongest same dataset performance, with 95.08% accuracy on UNSW-NB15 and 99.79% on TON_IoT, but performance dropped sharply in cross-dataset testing. When trained on UNSW-NB15 and tested on TON_IoT or vice versa, below 40%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
