An Ensemble Approach toward Automated Variable Selection for Network Anomaly Detection
Makiya Nakashima, Alex Sim, Youngsoo Kim, Jonghyun Kim, Jinoh Kim

TL;DR
This paper presents an automated variable selection method for network anomaly detection that identifies core features, reducing complexity while maintaining detection performance, demonstrated on public network traffic datasets.
Contribution
The paper introduces a novel ensemble-based approach that automates variable selection by combining well-known methods, addressing challenges in stopping criteria and result variability.
Findings
Identifies a small set of core variables that approximate full-feature performance.
Demonstrates effectiveness on UNSW-NB15 and IDS2017 datasets.
Reduces feature set size without significant loss in detection accuracy.
Abstract
While variable selection is essential to optimize the learning complexity by prioritizing features, automating the selection process is preferred since it requires laborious efforts with intensive analysis otherwise. However, it is not an easy task to enable the automation due to several reasons. First, selection techniques often need a condition to terminate the reduction process, for example, by using a threshold or the number of features to stop, and searching an adequate stopping condition is highly challenging. Second, it is uncertain that the reduced variable set would work well; our preliminary experimental result shows that well-known selection techniques produce different sets of variables as a result of reduction (even with the same termination condition), and it is hard to estimate which of them would work the best in future testing. In this paper, we demonstrate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Anomaly Detection Techniques and Applications · Internet Traffic Analysis and Secure E-voting
