Decision-forest voting scheme for classification of rare classes in   network intrusion detection

Jan Brabec; Lukas Machlica

arXiv:2107.11862·cs.LG·July 27, 2021

Decision-forest voting scheme for classification of rare classes in network intrusion detection

Jan Brabec, Lukas Machlica

PDF

TL;DR

This paper presents a Bayesian ensemble method for decision forests that improves detection of rare classes in network intrusion detection, maintaining high precision and increasing detection rates without additional parameter tuning.

Contribution

The novel Bayesian aggregation approach effectively handles class imbalance in decision forests without extra parameters, enhancing network intrusion detection performance.

Findings

01

Maintains over 94% precision in detection

02

Increases detection rate by approximately 7%

03

Effectively handles large-scale network data

Abstract

In this paper, Bayesian based aggregation of decision trees in an ensemble (decision forest) is investigated. The focus is laid on multi-class classification with number of samples significantly skewed toward one of the classes. The algorithm leverages out-of-bag datasets to estimate prediction errors of individual trees, which are then used in accordance with the Bayes rule to refine the decision of the ensemble. The algorithm takes prevalence of individual classes into account and does not require setting of any additional parameters related to class weights or decision-score thresholds. Evaluation is based on publicly available datasets as well as on an proprietary dataset comprising network traffic telemetry from hundreds of enterprise networks with over a million of users overall. The aim is to increase the detection capabilities of an operating malware detection system. While we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.