Imbalanced Malware Images Classification: a CNN based Approach

Songqing Yue; Tianyang Wang

arXiv:1708.08042·cs.CV·February 22, 2022·109 cites

Imbalanced Malware Images Classification: a CNN based Approach

Songqing Yue, Tianyang Wang

PDF

Open Access

TL;DR

This paper introduces a weighted softmax loss function for CNNs to improve malware image classification performance on imbalanced datasets, validated through extensive experiments.

Contribution

It proposes a novel weighted softmax loss that effectively addresses class imbalance in CNN-based malware image classification.

Findings

01

Improved classification accuracy on imbalanced malware datasets.

02

The weighted loss function enhances performance across different CNN architectures.

03

Empirical results demonstrate the effectiveness of the proposed method.

Abstract

Deep convolutional neural networks (CNNs) can be applied to malware binary detection via image classification. The performance, however, is degraded due to the imbalance of malware families (classes). To mitigate this issue, we propose a simple yet effective weighted softmax loss which can be employed as the final layer of deep CNNs. The original softmax loss is weighted, and the weight value can be determined according to class size. A scaling parameter is also included in computing the weight. Proper selection of this parameter is studied and an empirical option is suggested. The weighted loss aims at alleviating the impact of data imbalance in an end-to-end learning fashion. To validate the efficacy, we deploy the proposed weighted loss in a pre-trained deep CNN model and fine-tune it to achieve promising results on malware images classification. Extensive experiments also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning

MethodsSoftmax