Network Threat Detection: Addressing Class Imbalanced Data with Deep Forest

Jiaqi Chen; Rongbin Ye

arXiv:2506.08383·cs.LG·June 11, 2025

Network Threat Detection: Addressing Class Imbalanced Data with Deep Forest

Jiaqi Chen, Rongbin Ye

PDF

Open Access

TL;DR

This paper evaluates machine learning methods for IoT malware detection, focusing on class imbalance issues, and finds that ensemble methods like gcForest with resampling improve detection accuracy.

Contribution

It introduces an empirical analysis of imbalance handling techniques combined with ensemble methods for IoT threat detection, highlighting gcForest's effectiveness.

Findings

01

gcForest with resampling outperforms traditional methods

02

Addressing class imbalance improves detection performance

03

Ensemble methods are effective for IoT malware detection

Abstract

With the rapid expansion of Internet of Things (IoT) networks, detecting malicious traffic in real-time has become a critical cybersecurity challenge. This research addresses the detection challenges by presenting a comprehensive empirical analysis of machine learning techniques for malware detection using the IoT-23 dataset provided by the Stratosphere Laboratory. We address the significant class imbalance within the dataset through three resampling strategies. We implement and compare a few machine learning techniques. Our findings demonstrate that the combination of appropriate imbalance treatment techniques with ensemble methods, particularly gcForest, achieves better detection performance compared to traditional approaches. This work contributes significantly to the development of more intelligent and efficient automated threat detection systems for IoT environments, helping to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Imbalanced Data Classification Techniques · Network Security and Intrusion Detection