Group & Reweight: A Novel Cost-Sensitive Approach to Mitigating Class Imbalance in Network Traffic Classification
Wumei Du, Dong Liang, Yiqin Lv, Xingxing Liang, Guanlin Wu, Qi Wang,, Zheng Xie

TL;DR
This paper introduces a group and reweight strategy inspired by distributionally optimization to address severe class imbalance in network traffic classification, improving prediction accuracy and safety.
Contribution
It proposes a novel clustering and reweighting method based on a Stackelberg game framework to better handle minority malicious classes.
Findings
Improves classification performance on imbalanced network traffic data
Reduces negative effects of class imbalance on decision boundaries
Enhances safety in risk-sensitive network applications
Abstract
Internet services have led to the eruption of network traffic, and machine learning on these Internet data has become an indispensable tool, especially when the application is risk-sensitive. This paper focuses on network traffic classification in the presence of severe class imbalance. Such a distributional trait mostly drifts the optimal decision boundary and results in an unsatisfactory solution. This raises safety concerns in the network traffic field when previous class imbalance methods hardly deal with numerous minority malicious classes. To alleviate these effects, we design a group & reweight strategy for alleviating class imbalance. Inspired by the group distributionally optimization framework, our approach heuristically clusters classes into groups, iteratively updates the non-parametric weights for separate classes, and optimizes the learning model by minimizing reweighted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectricity Theft Detection Techniques · Traffic Prediction and Management Techniques · Imbalanced Data Classification Techniques
