On Defending Against Label Flipping Attacks on Malware Detection Systems

Rahim Taheri; Reza Javidan; Mohammad Shojafar; Zahra Pooranian; Ali; Miri; Mauro Conti

arXiv:1908.04473·cs.LG·June 17, 2020

On Defending Against Label Flipping Attacks on Malware Detection Systems

Rahim Taheri, Reza Javidan, Mohammad Shojafar, Zahra Pooranian, Ali, Miri, Mauro Conti

PDF

TL;DR

This paper introduces novel deep learning-based defenses against label flipping attacks in Android malware detection within IoT systems, demonstrating significant accuracy improvements over existing methods.

Contribution

It proposes two CNN-based semi-supervised defense algorithms, LSD and CSD, specifically designed to counter Silhouette Clustering-based Label Flipping Attacks in malware detection.

Findings

01

Defense algorithms improve accuracy up to 19% over state-of-the-art.

02

Effective label correction demonstrated on three Android datasets.

03

Varying features and parameters enhances detection robustness.

Abstract

Label manipulation attacks are a subclass of data poisoning attacks in adversarial machine learning used against different applications, such as malware detection. These types of attacks represent a serious threat to detection systems in environments having high noise rate or uncertainty, such as complex networks and Internet of Thing (IoT). Recent work in the literature has suggested using the $K$ -Nearest Neighboring (KNN) algorithm to defend against such attacks. However, such an approach can suffer from low to wrong detection accuracy. In this paper, we design an architecture to tackle the Android malware detection problem in IoT systems. We develop an attack mechanism based on Silhouette clustering method, modified for mobile Android platforms. We proposed two Convolutional Neural Network (CNN)-type deep learning algorithms against this \emph{Silhouette Clustering-based Label…

Tables4

Table 1. Table 1 : Important abbreviations used in this paper.

Notations	Description
$A M L$	Adversarial Machine Learning
$S S L$	semi-supervised learning
$L S D$	Label-based Semi-supervised Defense
$C S D$	clustering-based Semi-supervised Defense
$K S S D$	KNN-based Semi-Supervised Defense
$G A N$	Generative Adversarial Network
$C N N$	Convolutional Neural NetworK
$L P$	Label Propagation
$L S$	Label Spreading
$R I$	Rand Index
$M I$	Mutual Information
$F M I$	Fowlkes-Mallows Index

Table 2. Table 2 : AUC and FNR Comparisons in percent ( % percent \% ) for presented algorithms in various features, datasets in two test scenarios: WFS and WoFS. WFS= with feature selection; WoFS= without feature selection.

Other ML Metrics			Datasets
Ratio ( $%$ )			Drebin				Contagio				Genome
Algorithms		File Type	WoFS		WFS		WoFS		WFS		WoFS		WFS
			FNR	AUC	FNR	AUC	FNR	AUC	FNR	FNR	FNR	AUC	FNR	AUC
		Permission	10.71	82.64	11.71	60.56	6.67	59.06	23.79	60.76	5.95	96.21	3.46	68.80
		API	16.59	83.22	12.43	80.78	29.70	94.35	25.75	92.91	32.29	74.40	25.86	70.24
Attack	SCLFA	Intents	49.21	62.97	7.94	42.28	45.83	95.87	3.21	43.55	45.22	94.26	0.31	48.32
		Permission	2.17	93.90	2.33	70.77	1.65	80.32	0.81	61.64	1.29	93.89	2.05	68.65
		API	6.62	90.77	5.65	88.44	6.59	95.34	11.13	94.03	7.40	96.59	12.15	55.23
	LSD	Intents	10.20	91.97	0.43	43.70	6.28	89.42	0.75	50.07	7.27	91.82	0.53	50.41
		Permission	1.44	95.08	3.22	73.98	0.94	93.18	0.84	91.36	1.33	97.06	1.58	69.07
		API	2.53	98.40	2.04	98.28	1.47	95.05	0.42	91.39	0.84	98.42	0.65	96.83
	CSD	Intents	2	93.74	0.54	45.60	1.75	93.18	0.55	50.31	1.10	94.12	0.40	51.43
		Permission	48.95	47.00	11.06	70.93	47.91	0.54	25.28	61.19	33.62	55.49	3.51	69.14
Defenses		API	11.11	87.37	12.46	78.92	16.96	93.80	20.78	95.58	27.26	59.77	24.78	70.76
	GANX taheri2019can	Intents	87.35	8.96	6.99	43.50	33.46	57.75	2.10	71.60	26.36	60.84	0.57	83.29
		Permission	4.95	91.17	11.58	70.59	3.53	61.88	25.19	90.94	5.60	90.91	3.72	68.57
		API	10.78	86.27	12.46	78.92	19.81	94.45	23.27	94.25	19.91	72.48	24.73	71
	KSSD paudice2018label	Intents	39.10	70.93	6.91	42.39	35.11	89.02	2.35	44.58	41.79	90.04	0.30	50.51

Table 3. Table 3 : Computational complexity comparisons in seconds ( s 𝑠 s ) for presented algorithms in various features, datasets in two test scenarios: WFS and WoFS. WFS= with feature selection; WoFS= without feature selection.

Computational Complexity			Datasets
Time (s)			Drebin		Contagio		Genome
Algorithms		File Type	WoFS	WFS	WoFS	WFS	WoFS	WFS
		Permission	140.09	4.04	87.66	3.56	130.10	3.11
		API	7.14	4.71	4.84	3.88	4.21	3.74
Attack	SCLFA	Intents	150.99	3.83	209.89	2.87	106.07	2.92
		Permission	385.79	101.16	417.62	107.81	348.62	106.02
		API	123.91	114.64	117.35	112.75	109.87	105.38
	LSD	Intents	963.97	105.17	747.98	96.81	501.85	108.10
		Permission	148.15	11.50	118.77	9.51	123.45	9.16
		API	21.76	15.77	17.22	13.27	14.56	12.77
Defenses	CSD	Intents	281.83	11.26	235.24	9.21	198.63	11.42
		Permission	95.90	5.20	83.91	4.15	90.83	5.16
		API	9.95	7.59	8.53	6.46	8.41	6.42
	KSSD paudice2018label	Intents	210.99	5.17	206.77	4.12	146.55	5.12
		Permission	425.13	211.64	471.33	194.55	394.65	176.38
		API	94.23	67.45	86.56	64.75	75.34	57.14
	GAN taheri2019can	Intents	515.41	276.54	495.32	209.21	436.97	196.45

Table 4. Table A.1 : Testing the proposed methods (attacks and defenses) using training classification algorithms tested on API, Permission and Intent features in various datasets. No-A= No attack algorithm; RF= random forest; NN= neural network; SVM= support vector machine; DT= decision tree.

Drebin
		RF		SVM		DT		NN		CNN
	Algs.	Acc	FPR	Acc	FPR	Acc	FPR	Acc	FPR	Acc	FPR
	No-A	98.00	3.81	98.40	2.08	97.85	2.25	97.78	4.58	98.45	2.70
	SCLFA	83.03	28.81	83.37	27.58	82.24	27.63	82.75	29.92	83.42	28.08
API	LSD	90.92	17.91	91.29	16.46	90.74	16.56	90.67	18.87	91.34	17.01
	CSD	97.39	3.63	97.78	1.90	97.23	2.07	97.16	4.40	97.83	2.52
	KSSD paudice2018label	82.27	31.56	82.61	30.38	82.06	30.42	81.99	32.70	82.66	30.87
	GANX taheri2019can	82.82	28.31	82.90	28.38	82.54	27.72	84.67	25.79	83.30	29.07
	No-A	86.61	39.86	87.02	38.31	87.71	39.02	87.23	38.36	87.07	38.76
	SCLFA	74.97	59.97	75.38	58.52	76.23	58.44	75.59	58.88	75.42	58.88
Permission	LSD	85.85	45.10	86.26	43.59	85.90	43.04	86.47	43.71	86.30	44.01
	CSD	86.59	40.03	87.00	38.49	87.52	39.46	87.21	38.54	87.04	38.93
	KSSD paudice2018label	79.31	44.41	79.72	42.88	80.70	42.78	79.93	43.00	79.77	43.31
	GANX taheri2019can	79.86	45.69	80.00	40.89	83.23	43.22	82.46	39.68	80.34	43.03
	No-A	78.26	85.81	78.53	85.37	77.76	86.39	77.93	85.62	78.38	86.10
	SCLFA	68.75	87.04	69.01	86.68	68.25	87.54	68.41	86.87	68.86	87.30
Intents	LSD	75.50	87.32	75.76	86.93	74.99	87.85	75.16	87.13	75.61	87.59
	CSD	77.98	86.18	78.24	85.74	77.48	86.76	77.64	85.98	78.10	86.47
	KSSD paudice2018label	70.68	88.66	70.94	88.28	70.17	89.20	70.34	98.94	70.79	88.94
	GANX taheri2019can	70.53	92.07	71.22	86.13	72.56	88.16	67.96	98.90	72.82	89.21
Contagio
		RF		SVM		DT		NN		CNN
	Algs.	Acc	FPR	Acc	FPR	Acc	FPR	Acc	FPR	Acc	FPR
	No-A	98.45	2.70	98.45	6.72	98.27	6.38	97.11	10.36	97.95	12.90
	SCLFA	75.97	9.52	75.79	9.28	74.98	11.17	75.48	15.84	75.62	12.32
API	LSD	89.28	7.84	89.10	7.54	88.60	8.02	88.78	14.08	88.92	10.60
	CSD	98.39	12.61	98.21	12.46	97.42	14.08	97.89	19.06	98.04	15.47
	KSSD paudice2018label	78.44	7.28	78.25	6.96	77.67	8.31	77.94	99.72	78.08	10.03
	GANX taheri2019can	79.11	9.27	78.60	0.86	80.49	9.46	80.72	99.67	80.43	7.84
	No-A	98.30	12.02	98.30	8.61	98.87	10.73	97.60	18.10	97.95	15.02
	SCLFA	72.09	65.40	72.09	62.61	72.15	68.14	71.39	73.93	71.74	69.67
Permission	LSD	92.83	65.40	92.83	62.61	93.29	68.14	92.13	73.93	92.48	69.67
	CSD	98.07	12.61	98.07	9.20	98.63	11.36	97.37	18.71	97.72	15.62
	KSSD paudice2018label	70.89	64.81	82.30	8.22	72.43	42.89	70.19	73.31	70.54	69.07
	GANX taheri2019can	70.86	74.68	82.52	7.38	75.11	43.89	73.93	56.29	70.54	68.69
	No-A	90.43	92.81	90.43	92.81	91.75	75.07	90.37	85.87	96.84	29.13
	SCLFA	80.49	98.11	80.49	98.11	81.80	87.50	80.51	93.75	86.89	61.55
Intents	LSD	89.06	93.24	89.06	93.24	90.43	77.00	89.01	86.90	95.47	35.77
	CSD	90.11	94.03	90.11	94.03	91.43	76.27	90.05	87.02	96.52	30.54
	KSSD paudice2018label	81.92	96.98	81.92	96.98	83.24	86.06	81.93	92.53	88.33	59.18
	GANX taheri2019can	82.09	90.09	82.27	92.58	81.51	89.36	84.16	75.34	91.49	49.46
Gnome
		RF		SVM		DT		NN		CNN
	Algs.	Acc	FPR	Acc	FPR	Acc	FPR	Acc	FPR	Acc	FPR
	No-A	99.37	4.08	99.16	4.96	98.59	5.04	98.89	8.02	99.52	2.94
	SCLFA	71.97	53.88	71.76	55.37	71.55	50.84	71.49	54.58	72.12	54.20
API	LSD	84.66	83.45	84.45	86.62	86.46	48.55	84.18	81.48	84.81	85.51
	CSD	98.83	6.94	98.62	7.85	97.03	8.02	92.76	46.67	98.98	5.88
	KSSD paudice2018label	73.18	52.77	72.97	54.31	76.16	55.70	72.70	53.57	73.33	53.07
	GANX taheri2019can	73.40	59.39	73.33	44.49	79.41	60.96	75.81	34.86	73.22	53.42
	No-A	94.72	54.69	94.93	51.63	93.91	59.24	94.54	62.66	94.57	57.14
	SCLFA	92.80	55.51	93.01	52.44	91.99	60.08	92.62	63.52	92.65	57.98
Permission	LSD	94.12	55.83	94.34	52.70	93.31	60.52	93.94	64.04	93.97	58.37
	CSD	94.57	55.10	94.78	52.03	93.76	59.66	94.39	63.09	94.42	57.56
	KSSD paudice2018label	92.53	55.92	92.74	52.85	91.72	60.50	92.35	63.95	92.38	58.40
	GANX taheri2019can	93.22	62.34	93.10	43.60	91.98	59.36	92.71	47.73	92.85	57.59
	No-A	93.19	84.03	92.86	80.45	93.28	83.59	93.04	82.58	99.22	8.78
	SCLFA	88.42	91.22	88.09	88.86	88.15	94.07	88.27	90.27	94.45	43.03
Intents	LSD	91.21	89.84	90.88	86.69	91.54	90.75	91.06	88.56	97.24	25.00
	CSD	91.33	87.82	91.00	84.76	91.53	88.20	91.18	86.58	97.36	24.44
	KSSD paudice2018label	90.55	88.86	90.22	86.05	90.13	90.60	90.40	96.07	96.58	30.88
	GANX taheri2019can	88.96	94.51	90.58	79.02	90.25	87.26	90.64	95.89	96.55	30.25

Equations24

D = {(x_{i}, y_{i}) \in (X, Y)}, i = 1, \dots, n

D = {(x_{i}, y_{i}) \in (X, Y)}, i = 1, \dots, n

S = {(x_{k}, y_{k})}, k = 1, \dots, m

S = {(x_{k}, y_{k})}, k = 1, \dots, m

L_{i} = {(x_{i}, y_{i}), (x_{i}, ∣1 - y_{i} ∣), S V > 0 otherwise

L_{i} = {(x_{i}, y_{i}), (x_{i}, ∣1 - y_{i} ∣), S V > 0 otherwise

I (X_{1}, X 2) = x_{1} \in X_{1} \sum x_{2} \in X_{2} \sum p_{(X_{1}, X_{2})} (x_{1}, x_{2}) lo g_{2} (\frac{p _{(X_{1}, X_{2})} ( x _{1} , x _{2} )}{p _{X_{1}} ( x _{1} ) p _{X_{2}} ( x _{2} )})

I (X_{1}, X 2) = x_{1} \in X_{1} \sum x_{2} \in X_{2} \sum p_{(X_{1}, X_{2})} (x_{1}, x_{2}) lo g_{2} (\frac{p _{(X_{1}, X_{2})} ( x _{1} , x _{2} )}{p _{X_{1}} ( x _{1} ) p _{X_{2}} ( x _{2} )})

H M = 1 - \frac{H ( Y _{T} ∣ Y _{P R} )}{H ( Y _{T} )}

H M = 1 - \frac{H ( Y _{T} ∣ Y _{P R} )}{H ( Y _{T} )}

A cc = \frac{Ω + χ}{Ω + χ + Λ + ν}

A cc = \frac{Ω + χ}{Ω + χ + Λ + ν}

P r ec i s i o n = \frac{Ω}{Ω + Λ}

P r ec i s i o n = \frac{Ω}{Ω + Λ}

R ec a l l = \frac{Ω}{Ω + ν}

R ec a l l = \frac{Ω}{Ω + ν}

\small\begin{split}{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}F1-Score}=\frac{1}{\frac{1}{Recall}+\frac{1}{Precision}}=\\ 2*\frac{Precision*Recall}{Precision+Recall}\end{split}

\small\begin{split}{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}F1-Score}=\frac{1}{\frac{1}{Recall}+\frac{1}{Precision}}=\\ 2*\frac{Precision*Recall}{Precision+Recall}\end{split}

F P R = \frac{Λ}{Λ + χ}

F P R = \frac{Λ}{Λ + χ}

A U C = \frac{1}{2} (\frac{Ω}{Ω + Λ} + \frac{χ}{χ + Λ})

A U C = \frac{1}{2} (\frac{Ω}{Ω + Λ} + \frac{χ}{χ + Λ})

F N R = \frac{ν}{ν + Ω}

F N R = \frac{ν}{ν + Ω}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFeature Selection

Full text

∎

11institutetext: R. Taheri and R. Javidan 22institutetext: Department of Computer Engineering and Information Technology, Shiraz University of Technology, Shiraz, Iran

22email: {r.taheri, javidan}@sutech.ac.ir 33institutetext: M. Shojafar, Z. Pooranian and M. Conti 44institutetext: SPRITZ, Department of Mathematics,

University of Padua, Padua, Italy

44email: [email protected]; [email protected]

44email: {zahra, conti}@math.unipd.it 55institutetext: A. Miri 66institutetext: Department of Computer Science,

Ryerson University, Toronto, Canada

66email: [email protected]

On Defending Against Label Flipping Attacks on Malware Detection Systems

Rahim Taheri

Reza Javidan

Mohammad Shojafar

Zahra Pooranian

Ali Miri

Mauro Conti

(Received: 23 July 2019 / Revised: 22 February 2020/ Accepted: 04 March 2020)

Abstract

Label manipulation attacks are a subclass of data poisoning attacks in adversarial machine learning used against different applications, such as malware detection. These types of attacks represent a serious threat to detection systems in environments having high noise rate or uncertainty, such as complex networks and Internet of Thing (IoT). Recent work in the literature has suggested using the $K$ -Nearest Neighboring (KNN) algorithm to defend against such attacks. However, such an approach can suffer from low to miss-Classification rate Accuracy. In this paper, we design an architecture to tackle the Android malware detection problem in IoT systems. We develop an attack mechanism based on Silhouette clustering method, modified for mobile Android platforms. We proposed two Convolutional Neural Network (CNN)-type deep learning algorithms against this Silhouette Clustering-based Label Flipping Attack (SCLFA). We show the effectiveness of these two defense algorithms - Label-based Semi-supervised Defense (LSD) and clustering-based Semi-supervised Defense (CSD) - in correcting labels being attacked. We evaluate the performance of the proposed algorithms by varying the various machine learning parameters on three Android datasets: Drebin, Contagio, and Genome and three types of features: API, Intent, and Permission. Our evaluation shows that using random forest feature selection and varying ratios of features can result in an improvement of up to 19% Accuracy when compared with the state-of-the-art method in the literature.

Keywords:

Adversarial Machine Learning (AML), semi-supervised defense (SSD), malware detection, adversarial example, label flipping attacks, deep learning.

1 Introduction

Machine learning (ML) algorithms have the ability to accurately predict patterns in data. However, some of the data can come from uncertain and untrustworthy sources. Attackers can exploit this vulnerability as part of what is known as Adversarial Machine Learning (AML) attacks. Poisoning attacks or data poisoning attacks are a subclass of AML attacks, in which attackers inject malicious data into the training set in order to compromise the learning process, and effect the algorithm performance in a targeted manner. Label flipping attacks are a special type of data poisoning, in which the attacker can control labels assigned to a fraction of training points. Label flipping attacks can significantly diminishes the performance of the system, even if the attacker’s capabilities are otherwise limited. Recent work in AML looks into effectiveness of poisoning attacks in degrading performance of popular classification algorithms, such as support vector machines (SVM) zhou2012adversarial , embedded features selection methods xiao2015feature ; zhang2016adversarial , neural networks ganin2016domain , and deep learning systems papernot2016distillation . Most attacks in the literature assume attackers can manipulate both features and labels associated with the poisoning data. However, sometimes the attacker’s capabilities are limited to manipulating labels, and he is only able to flip the labels to fool the ML classifier. These types of attacks are known as flipping attacks. Deep neural networks (DNNs) have gained significant success in classifying well labeled data. However, label flip type poisoning attacks can reduce the Accuracy of these algorithms yang2017generative . Therefore, there is a need for alternative methods for training DNNs that takes label flipping attacks into account. Such methods should be able to identify and correct mislabeled samples or re-weight the data terms in the loss function according to the extracted label.

There are a number of work in the literature focused on identifying and dealing with poisoning attacks. For example, an algorithmic method evaluates the impact of each training sample on the performance of learning algorithms bhagoji2017dimensionality . Although this method is effective in some cases, it cannot be generalized to the large dataset. Among other defensive mechanisms, the outlier detection is used to identify and remove suspicious samples. But, this method has a limited performance (i.e., Accuracy) against label flipping attacks paudice2018label . Another category of related works mainly focus on learning strategies that can be applied on flip labels. Such solutions are divided into two categories. In the first group, it can directly learn flipped labels, whilst in the second group, it can focus on an extra set of clean data. In the first case, the label flipping module is used to identify correctly labeled data natarajan2013learning ; xiao2015support , and to modify the changes on the labels to reset the data terms in the loss function. Performance of this technique is significantly impacted by its label cleaning Precision and its rate of flip sample estimation. In the second group of methods, an additional set of clean data is used to guide the learning agent through flipped data ren2018learning . Despite promising results, both groups of methods have a common default. They try to fix the flipped labels, or they re-weight the terms for data points. This default will inevitably cause errors for some data points.

Motivated by these considerations, in this paper, we consider the binary classification for sampling and analysis of Android malware. We only assume the weakest capability for the attacker. That is, we assume that the attacker has no perfect knowledge about the learning algorithm, the loss function optimized by the system, or the initial the training data and a set of features used by the learning algorithm. We show that having the system identifying and retraining the wrong label, and using our proposed Semi-Supervised (SS) approach to training will result in better results. To this end, we suggest a solution that covers the existing data points that are mislabeled and improves the Accuracy of the classification algorithm. To do so, we present an architecture for learning flipped data. Then, we identify a small part of the mislabeled training set, whose labels are likely to be correct, and the flipped labels associated with other data are ignored. Afterward, we train a deep neural network in a SS manner based on selected data.

1.1 Contributions

In this context, several natural questions are arising, such as: How can we define attack based on label flipping algorithm which can fool the classifier? Is it possible to design an enhanced ML model to improve system security by presenting some secure algorithms against a given label flipping attack? How can we tune and test the countermeasure solutions to deal with label flipping attack? The answer to these queries is the goal of this paper. More in detail, the goal of the paper summarizes as follows: First, we rank the data points within each class and then hold the label for the points that have higher rankings. If no clean set is available, the ranking is based on the multi-way classification neural network, which is trained from the original training dataset. In fact, a binary classifier is learned that, while clean labels are available, separates data containing clean labels and flipped labels. Second, we apply a temporary ensemble for semi-supervised deep neural network training. Hence, our original contributions are as follows:

•

We present an architecture for learning flipped data which reflects our main focus in the malware detection system.

•

We propose a label flipping poisoning technique to attack the Android malware detection based on deep learning: where an algorithm is proposed for crafting efficient prototypes so that the attacker can deceive the classification algorithm. In this technique, we use Silhouette clustering to find an appropriate sample to flip its label.

•

We introduce a DL-based semi-supervised approach against label flipping attacks in the malware detection system called LSD, which uses label propagation and label spreading algorithms along with CNNs to predict the correct value of labels for the training set.

•

We implement a countermeasure method based on clustering algorithms as a defense mechanism. It is a DL-based semi-supervised approach against label flipping attacks in the malware detection system that improves the detection Accuracy of the compromised classifier. In this approach we use four clustering metrics and validation data to re-labeled poisoned labels.

•

We conduct our experiments on two scenarios on three real Android datasets using three feature types compared to the cutting-edge method and deeply analyze the trade-offs that emerge. The source code of the paper is available in Github Teheri2020NCAA .

To best of our knowledge, none of the previous works in literature has conducted a similar analysis. The closet paper to our method is KNN-based Semi-Supervised Defense (KSSD) paudice2018label , in which the authors have entailed KNN strategy to relabel samples by considering the distance between them. However, the work in paudice2018label is tailored to the relabeling of samples, they are unable to specify some similar samples that may be malware and benign and may mislabel the features of benign sample due to low distance of samples. Unlike the paudice2018label , in this paper we explicitly tackle the poisoning samples located far from the decision boundary and relabel them. Also the defense method presented in paudice2018label is unable to distinguish overlapping areas of two classes and cannot correctly label the poisoning samples located there while our defense methods imposes the model to tackle such data points and relabeling them.

1.2 Organization of the paper

We organize the rest of the paper as follows. Section 2 overview the related works. Section 3 details the problem definition, the presented architecture, and the related components. Section 4 presents our proposed attack model inspired by AML architecture and reports the proposed defense strategies against the raised attack. We evaluate the performance of the algorithms in Section 5. In Section 6 we detail the results of the experiment and provide some open discussion regarding our method. Section 7 presents conclusions and future work. Table 1 shows the important abbreviations used in this paper.

2 Related work

In this section, we classify the related work in the literature into two different defense classes: i) we will cover defense approaches that try to correct labels in Section 2.1, and ii) defense strategies that ignore poisoned labels and adopt semi-supervised learning methods to protect the model against attacks are then covered in Section 2.2. Hence, we draw conceptual relationships and delineate the most recent defense strategies applied to tackle the label flipping attack and identify relevant major alternatives for comparison.

2.1 Defense algorithms against poisoning attacks

The problem of classification with label noise - mislabeling in class variable - is an active area of research. The paper frenay2014comprehensive gives a comprehensive overview of both the theoretical and applied aspects of this area.Label flipping mechanism is a solution to cover label noise in the classifiers bootkrajang2012label . This method can model the overall label flipping probability. However, it is lack of considering individual specific characteristics in label noise. In laishram2016curie , the authors create a lightweight method called Curie to protect SVM Classier against poisoning attacks. The preliminary idea behind this method is to distinguish the suspicious data points and remove them outside the dataset before starting the learning step of the SVM algorithm. In other words, Curie’s algorithm flips labels in the training dataset to defend SVM classifiers against poisoning attacks. They cluster the data in the feature space and try to calculate the average distance of each point from the other points in the same cluster with related weight and train model and test in some datasets. They present that their defense method is able to correct 95% of samples in the training dataset. Additionally, the authors in munoz2017towards describe a poisoning algorithm to solve the bi-level optimization problem based on back-gradient optimization maclaurin2015gradient . The proposed algorithm applies automatic differentiation technique to compute the gradient in the optimization problem. This algorithm using gradient method to resolve the optimization problem which takes several computational time, it can pose challenges in complex networks such as neural networks and deep learning. Thus, they apply a novel technique named back-gradient optimization to allow computing the gradient of interest in a more computationally efficient and stable manner to shape their ML model. Authors in wang2018data explicitly investigate data poisoning attacks for the semi-online setting, unlike other works which mostly based on the offline setting. The work in shafahi2018poison argues that it is possible to perform targeted attacks on specific testing data without declining the overall performance of classifier along with any control of adversary over the labeling of training data. - The methodology proposed in baracaldo2018detecting is suitable to identify and remove poisonous data in IoT systems. This method, mainly, exploits data provenance to identify manipulated data before the training step to improve the performance of classification. Compared to our method, the defense method presented in baracaldo2018detecting cannot correctly label the poisoning samples while our defense methods imposes the model to tackle such data points and relabeling them. The work in bootkrajang2014learning focus on building an automatic robust multiple kernel-based logistic regression classifier against poisoning attacks without applying any cross-validation. Despite the fact that proposed classifier may improve performance and learning speed; it does suffer from lack of any theoretical guarantees. To address this issue, they extend their method and entail new structure to resist the negative effect of random label noise as well as a wide range of non-random label noises bootkrajang2016generalised .

2.2 Semi-supervised learning defense algorithms

Another active area of research is the one dealing with learning from unlabeled data. The semi-supervised learning approach, along with applying unlabeled data to learn better models is particularly relevant to our work. The semi-supervised approaches include multiview learning like dong2018tri , co-training ren2018learning ; xia20183d , graph-based methods like iscen2019label , and semi-supervised ML solutions like SVM li2014towards , and our proposed work (DL-based semi-supervised solution). These approaches try tackle that many successful learning algorithms need access to a large set of labeled data. To address this issue, i.e. lack of availablity of labeled data, a combination of tri-training with a deep model wee used in dong2018tri to build Tri-Net, which can use massive set of unlabeled data to help to learn with limited labeled data. The semi-supervised deep learning model generates three modules to exploit unlabeled data by considering model initialization, diversity augmentation, and pseudo-label editing. Graph-based transduction approach that works through the propagation of few labels, called label propagation, was used in iscen2019label to improve the classification performances and obtain estimated labels. This method consists of two steps. In the first step, the classifier trains through labeled and the predicts pseudo-labeled. In the second step, the nearest neighbor graph constructs based on the previous trained classifier. A limitation to this approach is that practically graph models are often mis-specified. However, this could potentially be overcomed by employing highly expressive model families like neural networks kaiser2015neural . Hence, in S3VM method li2014towards , the authors adopt SVM solution to finding the flipped label examples in a dataset and improve the safeness of the semi-supervised support vector machines (S3VM). They indicate that performance of their method is not statistically significantly worse than the solution shaped with labeled data alone. The major limitation of this method is that it is not easy to use such method for large amounts of noisy samples and outliers and it exponentially reduce ML performance.

3 System model and proposed architecture

In this section, we first provide a formal definition our problem (see Section 3.1). Then, in Section 3.2, we introduce the proposed Android malware detection architecture used in the paper. In particular, Fig. 1 will describe the components of the proposed architecture.

3.1 Problem definition

Consider the datasets as follows.

[TABLE]

where $n$ is the number of malware samples. If $x_{i}$ has the $j$ feature, we have $x_{ij}=1$ . Otherwise $x_{ij}=0$ , and $X\subseteq\{0,1\}^{k}$ - a $k$ -dimensional space. The variable $y$ represents the label of the samples with $y_{i}\in\{0,1\}$ and the $D$ set has an unknown distribution on $X\times Y$ . We assume the training set is defined as follows.

[TABLE]

where $S$ is the label set. The flipping attack label aims to find a collection such as $P$ containing samples in $S$ so that when their labels are flipped, it minimizes the desired target for the attacker. For simplicity, we assume that the attacker’s goal is to maximize the loss function which we define it as $L(w,(x_{j},y_{j}))$ .

3.2 Proposed architecture

In this section, we present our architecture to tackle the Android malware detection problem in IoT systems (see Fig. 1).

In Fig. 1, we present a general scheme of our proposed architecture and the proposed attack and defense algorithms which use for Android applications. In this architecture, we assume a complex set of IoT devices (i.e., IoT systems) which are communicating with each other, represented by the yellow oval in the figure. We assume that some of the IoT devices are using Android OS platforms. We also assume that an attacker can get access to some of the IoT devices. Hence, he can manipulate the data they transferring to each other. As a result, the data traffic of each Android data can include those from malware apps, represented by the black Android app symbol in our figure. Each Android app, whether malware and benign, presents as a vector of different features with various labels. ML algorithms exposed to adversary attacks can add a variety of perturbations to data to fool ML algorithms. Hence, in this architecture, an adversary can get access to the dataset and flip the labels by adding some perturbation of existing labels. Our feature selection component gives the ability to select the choice of features. We then generate a binary vector of each Android app and input the result to the ML model. A final component of our architecture is the detection system composed of the ML model and our proposed defense algorithms. Our architecture can increase the robustness of our detection system against flipping label attacks, and increase the Accuracy of malware/benign classifications. In the following section, we explore our attack and defense algorithms.

4 Proposed attack and defensive solutions

In this section, the proposed classification algorithm used in the paper is described first in Section4.1. We then describe our attack strategy, inspired by Silhouette clustering method in Section 4.2. Section 4.3 presents our two defense solutions against the attack proposed in the previous section. Finally, we report the computational complexity of our strategies in Section 4.4.

4.1 Classification algorithm

In this paper, we incorporate a deep CNN to classify the binary samples. We adopt the overfitting method to find out how good our dataset size is. Shift invariant or CNN is a multilayer perceptrons strategy to tackle the fully connected neurons in each layer and help to prone the over-fitting data and can include more complex patterns. To do so, we try to classify our data using a training set and then repeat the classification using cross-validation. If we increase the data size, it gives better results in CNN classification processing.

Fig. 2 presents the proposed CNN architecture for the classification algorithm. In this figure, we can see that we apply three sequential layers of one-dimensional convolution (Conv-1D) that has 16, 32, and 64 filters. In each of these layers, we have kernel-size with value 2 and stride with value 2. We apply Maxpooling between the convolution layers to prevent overfitting by reducing the computational load, memory, and number of parameters. Each Maxpooling layer creates four pool size with two strides. After applying three convolutional layers, we adopt a Flattened layer and a Dense layer. In the Dense layer, we use $Adam$ optimizer and $Sigmoid$ activation function to shape the classification algorithm and the out of the Dense layer is the classified data.

4.2 Attack strategy: Silhouette Clustering-based Label Flipping Attack (SCLFA)

In this subsection, we apply Silhouette clustering method to flip the labels. We name this attack silhouette Clustering-based Label Flipping Attack (SCLFA). Silhouette clustering is a type of clustering technique in which can interpret and validate the consistency of data clusters. Silhouette provides a concise visual presentation object classifications. This technique defines a measurement called silhouette value (SV) that expresses the self cluster similarity or cohesion of per object comported to other clusters or separation, which is between [-1,1]. If the silhouette value is one, it presents well matching of the object to its own cluster and is less likeness to other neighboring clusters. If the majority of the objects in a cluster have high SVs, it indicates that the clusters objects and the clustering is appropriately configured. We utilize a Euclidean distance method to calculate the SV in this paper. We define the label flipping attack (LFA) as follows:

Definition 1

LFA in SCLFA: LFA is a type of attack that the attacker tries to use some algorithms to modify the label of features and changes the interval of each sample in a cluster. In this paper, we use the silhouette clustering algorithm to implement LFA. To put it simply, in SCLFA, we assign an interval [-1,1] for each sample, which indicates whether the sample is in the correct cluster. If the silhouette value (SV) is negative, it means that the selected sample is a good candidate for flipping the label, and according to the silhouette algorithm, it is definitely belonging to another cluster. Hence, we change the label of such sample. Let $L_{i}$ be the label of the $i$ -th sample out of $n$ samples in the dataset. Thus, we can write it as eq. (3):

[TABLE]

Algorithm 1 presents the label flipping poisoning attack.

Description of the Algorithm 1. In this algorithm, we present the proposed method, SDLFA, for the flipping label of the training sample. This method is based on the K-means clustering algorithm. In this way, we first create a model based on the K-means algorithm that divides the $X\_train$ samples into two clusters and predicts the label for each sample (lines 1-2). Then, in line 3, we calculate the $Silhouette$ values for samples and predicted labels for the samples. As previously stated, values close to 1 indicate that the sample is fitted in the appropriate cluster, and as the values of Silhouette are less than 1 and close to -1, it means that the sample is clustered incorrectly. In the proposed method, we flipped the label of samples that have a Silhouette value less than zero. In this way, we probably have chosen the examples that have the potential to be in the other cluster (lines 4-8).

4.3 Defensive Strategies

In this subsection, we discuss these countermeasures against the label flipping attack. In detail, we describe Label-based Semi-supervised Defense (LSD) and Clustering-based Semi-supervised Defense (CSD) which are presented in Sections 4.3.1 and 4.3.2, respectively. In this paper, we assume our data are only partially labeled. Our defense strategies begins by investigating which validation data in training samples may have been flipped. It would then predict new labels for these data and replace their labels. Fig. 3 shows the overview of the semi-supervised learning (SSL) model for both defense strategies.

4.3.1 LSD Defense

In this section, we design LSD algorithm to give a priority between semi-monitoring learning (SML) methods. In other words, we adopt validation data as inputs of SML algorithms to predict the label for each sample and then rank the predicted labels. The goal of the LSD algorithm is to find the samples for which the labels in the flipped training set are likely to have the correct values. Then, we need to give the selected data and its labels to the SSL algorithm. We need to create a validation set to monitor the training process and select the suitable parameters. That is, in the LSD method, we first rank the data points within each class, and then hold the label for the points that have the highest rankings. If no clean set is available, ranking is applied which is designed based on the multi-way classification neural network. Hence, Ranking is trained from the original training dataset. In fact, in this defense mechanism, we try to learn a binary classifier while clean labels are available. Then, we separate data containing clean labels and flipped labels. Formally speaking, in this defense strategy, in the first stage, we apply the Label Propagation (LP) algorithm to assign the labels to unlabeled data points. Then, in the next stage, we use Label Spreading (LS) to minimize the noises happen in labeling the samples. In LSD method, we plan to design a method which works like an ensemble learning such that it uses propagation models to predict labels for flipping. In this way, we provide a two-stage framework for learning flipped labels. In the following, we describe LP and LS.

•

Label Propagation (LP). LP is a type of semi-supervised ML algorithm can give a label to the unlabeled sample data. First, LP gives labels a small dataset of samples and make classifications. In other work, LP aims to propose the labels to the unlabeled data points. That is, LP helps to find the community structure in real complex networks aviles2019beyond . LP compared to the other practical methods in literature has much lower processing time and could support apriori information needed about the network structure, and it does not require any knowledge of data point and samples before propagation. However, LP could produce several solutions for each set of data points.

•

Label Spreading (LS). LS algorithm is a type of propagation method that can apply the normalized graph Laplacian and soft clamping in an affinity matrix to influence on the labels. It also can diminish the regularization properties of a loss function to and make it robust against the noise LP . LS algorithm repeats on the modified version of a graph of data points and can normalize the edge weights by computing the normalized graph Laplacian matrix.

LP and LS algorithms create on a kernel of the system in which positively effect on the performance of the algorithm and enhance the chance of scalability of the problem. To be precise, and as an example, the RBF kernel can generate a fully connected graph that can demonstrate a dense matrix. Such big size matrix, in each iteration, could join with the cost performance full matrix multiplication calculation and results in increasing the time complexity, which causes a problem for scalable case studies. In this paper, we fix the problem by utilizing LP and LS algorithms on a KNN kernel system which provides much more memory-friendly sparse matrix and can exponentially save on execution latency.

In the first stage of LSD algorithm, we use validation set to train the LP and LS algorithms. Then, we use these algorithms to predict labels of training set. At the same time we train the CNN classifier with the validation data and predict new labels for training set samples. In the second stage, we use voting between all available labels, i.e., LP output, Label Spreading, CNN predicted labels and poisoned labels.

In the second stage of LSD algorithm, we apply a temporary ensembling for semi-supervised Deep Neural Network training. Then, we present a semi-supervised two-stage algorithm for training flipped labels, which include two main components. We discover and select some samples from the labeled training set, for which there are strong indications that their labels are correct. Afterward, we aim to learn a semi-supervised deep neural network that only uses the selected labels from the first previous stage. Finally, the ML model network can easily classify previously unseen test data. We summarize the proposed LSD countermeasure algorithm in Algorithm 2.

Description of the Algorithm 2. It presents the semi-supervised defense, which is based on Label estimation. As seen in this algorithm, in lines 3-5, the label spreading algorithm is applied, which is used to find labels of training data. The label spreading algorithm is trained using validation data and then created a model is used to predict labels of training data. Similarly, lines 6-8 use the Label Propagation algorithm to predict training data labels. This algorithm, like the Label Spreading algorithm, is a semi-supervised algorithm. In lines 9 and 10 of this algorithm, convolutional neural network as the third part of the ensemble learning approach is used, which is trained with validation data and is used to predict the training data label. The final part of the LSD method is the voting between the results of the three methods described and the poisoned label, which is the result of voting as the label for training samples.

4.3.2 CSD Defense

The main idea behind this approach is to use clustering techniques to correct flipped labels. As each of the clustering methods has its specific measure, in this method it is suggested to use the voting between the label determined by different clustering methods for determines the label of the flipped samples. Hence, we use four indices to analyze the Accuracy of our generated clusters and the predicted one and identify the most likely adversarial examples and flip their labels.

Description of Algorithm 3. In this algorithm, we explain the CSD method. In lines 1-3 of this algorithm, we use the proposed CNN model and validation data and predict the labels of the training data. Lines 4 to 7, the algorithm describes four cluster metrics, namely RI, MI, HM, and FMI and compute their values. Each of these metrics is a measure for the Accuracy of clustering. The main idea behind this approach is that the training samples are labeled in such a way that the mentioned measure does not differ significantly from the values calculated from the validation data. Therefore, in lines 8-16, we add one sample of the training data to the validation dataset, calculate the values of the clustering with four metrics , and compare them with the base values. If the difference is less than 0.1 (i.e., we consider as a threshold), then we consider the sample to be properly labeled. As a result, the output of this algorithm is the labeled sample which can be used as a validation data and selected sample for training the ML model.

The indices are defined as below.

•

Rand Index (RI): Rand measure/index is a statistical index to calculate the similarity between two data clustering rand1971objective . It is a value between zero and one such that zero indicating that two sets of clustered data do not have any pair point and one indicating that the data clustering is the same. Also, RI can be used to adjust a group for elements that we called them adjusted Rand index. In other words, RI is a metric of the Accuracy of two sets of data points, which represents the frequency of occurrence of total pairs. Formally, speaking, RI presents the probability of how can we randomly select two pair $X_{1}$ and $X_{2}$ in two partitions of the same big set.

•

Mutual Information (MI): MI, or information gain is a measure to realize the amount of information and dependency between two separate variables by observing them MI . It is a type of entropy of a random variable that can understand the joint distribution of a pair data points which calculates by the product of the marginal distribution of those pair samples. Since the data we are dealing with are fallen in the group of discrete data with discrete distribution, we can calculate the $\mathcal{I}$ MI of two jointly discrete random variable $X_{1}$ and $X_{2}$ as follows:

[TABLE]

where $p_{(X_{1},X_{2})}$ is a joint probability mass function for the two samples of $X_{1}$ and $X_{2}$ and $p_{X_{1}}$ is a marginal probability of sample $X_{1}$ and $p_{X_{2}}$ is a marginal probability of sample $X_{2}$ .

•

Homogeneity Metric (HM): This metric uses for validating the data points which are members of a single class. HM is independent of being changed the score value of data point when a permutation of the class or labels are applied hirakawa2005adaptive . We can define HM values as $\mathcal{HM}$ as follows:

[TABLE]

where $\mathcal{HM}$ can be between 0 and 1. Note that low values of $\mathcal{HM}$ explains a low homogeneity, vice-versa. If we have a sample data $Y$ , we define $Y_{PR}$ , $Y_{T}$ are the predicted and the corrected value for that sample, hence, $H(Y_{T})$ is HM value for that sample when it is correctly placed and predicted to be placed in one single class, respectively. Besides, the $\frac{H(Y_{T}|Y_{PR})}{H(Y_{T})}$ indicates that the predicted sample is not placed correctly in a single class. We aim to approach this fraction smaller and reach it to zero $(\mathcal{HM}\rightarrow 1)$ . We can achieve this goal when we reduce the knowledge of $Y_{PR}$ and diminish the uncertainty of $Y_{T}$ that results in the fraction above become smaller, and we have HM around 1.

•

Fowlkes-Mallows Index (FMI): Fowlkes–Mallows Index (FMI) metric is a popular metric to understand the similarity between two generated clusters, whether hierarchical or benchmark classification clusters guo2019clustering . The higher similarity between two clusters (created cluster and the benchmark one) indicates higher FMI values. FMI is an accurate metric uses to evaluate the unrelated data and also is reliable even with added noises to the data results.

4.4 Computational Complexity

In following section, we evaluate computational complexity analysis on the presented attack and defensive methods. Assume that the number of samples in $X\_train$ and $X\_validation$ are $n$ and $m$ , respectively. We list the computational complexity of the methods. So, we have

•

**Time Complexity of SCLFA Attack

Focusing on SCLFA, the computation of all possible configurations in line 1-2 of Algorithm 1 creates a model based on the $K$ -means method and predicts the correct $n$ training samples, results in $\mathcal{O}(n^{2}\times k)$ . Since in this method, $k=2$ , the time complexity is in the order of $\mathcal{O}(n^{2})$ . In line 3 of this algorithm, Silhouette values are computed for $n$ training data samples, which has a complexity of $\mathcal{O}(n^{2})$ . Lines 4-8 of the algorithm include a for loop that performs the correction of the $m$ validating labels and has a complexity of $\mathcal{O}(m)$ . Overall, the computational complexity of the Algorithm 1 is in the order of $\mathcal{O}(n^{2})+\mathcal{O}(n^{2})+\mathcal{O}(m)$ = $\mathcal{O}(n^{2})$ , $\forall\ n\gg m$ .

**

•

**Time Complexity of LSD Defense

Focusing on LSD, the computation of Algorithm 2 directly relates to the LS method, which has a complexity of $\mathcal{O}(n)$ . Similarly, in lines 6-8, the model is based on the LP algorithm, which has a complexity of $\mathcal{O}(n)$ . Then, lines 9 and 10 present CNN model creating, according to he2015convolutional , which has a computational complexity of all convolutional layers. CNN computational complexity is $\mathcal{O}(\Sigma{{}^{d}}_{i=1}{n_{(l-1)}}\times s^{2}_{l}\times m^{2}_{l})$ , where $l$ is the index of a convolutional layer; $d$ is the depth (number of convolutional layers); $n_{l}$ is the width or the number of filters in the $l_{th}$ layer– $n_{(l−1)}$ is the number of input channels of the $l$ -th layer; $s_{l}$ is the spatial size (length) of the filter and $m_{l}$ is the spatial size of the output feature of CNN which has a time complexity in the order of $\mathcal{O}(n^{3})$ . Then, we performs voting between results that has a complexity of $\mathcal{O}(1)$ (line 11). Overall, the computational complexity of LSD defense algorithm is $\mathcal{O}(n)+\mathcal{O}(n)+\mathcal{O}(n^{3})+\mathcal{O}(1)$ = $\mathcal{O}(n^{3})$ .

**

•

**Time Complexity of CSD Defense

Focusing on CSD, the computation of Algorithm 3 relies on CNN model construction based validation data (lines 1-2). Then, we predict the label for training data samples based on this generated ML model. Therefore, the computational complexity of this part is in the order of $\mathcal{O}(n^{3})$ . Focusing on the RI, MI , HM and FMI clustering metric calculations, they have a complexity of $\mathcal{O}(n)$ (lines 4-7). Then, we calculate the values of these parameters for $m$ samples. Hence, the complexity of this loop of the CSD algorithm is in the order of $\mathcal{O}(n\times m)$ (lines 8-16). As a result, the overall computational complexity of CSD defense method is $\mathcal{O}(n\times m)+\mathcal{O}(n^{3})+\mathcal{O}(n)$ = $\mathcal{O}(n^{3})$ , $\forall\ n>>m$ .

**

5 Experimental evaluation

In this section, we report the results of our proposed attack and defense algorithms in different scenarios: with feature selection consideration (WFS) and without feature selection consideration (WoFS). Given the two scenarios, we conduct our experiments on our attack (SCLFA) and defense algorithms (LSD and CSD) against KNN-based Semi-Supervised Defense (KSSD) paudice2018label . The source code of the paper is available in Github Teheri2020NCAA .

5.1 Simulation setup

We describe the test metrics, datasets, features, classification parameter, and comparison defense algorithm below.

5.1.1 Test metrics

To provide a comprehensive evaluations of our attack and defense algorithms, we use the following indices: Accuracy, Precision, Recall, False positive rate (FPR), True negative rate (TNR), miss rate (FNR), F1-score, and area under cover (AUC):

•

Accuracy: Accuracy metric is defined in:

[TABLE]

where $\Omega$ is true positive; $\chi$ is true negative; $\Lambda$ is false positive, and $\nu$ is false negative metrics.

•

Precision: Precision is the fraction of relevant samples between the retrieved samples which is shown in

[TABLE]

•

Recall: The Recall is expressed in

[TABLE]

•

F1-Score: This metric defines as a harmonic mean of Precision and Recall which is defined as

[TABLE]

•

False Positive Rate (FPR): This metric represents a ratio between the number of negative events incorrectly classified as positive (false positives) and the total number of actual negative events. This metric is described in equation (10):

[TABLE]

•

Area Under Curve (AUC): AUC measures the trade off between misclassification rate and FPR. This metrics can be calculated as (11):

[TABLE]

•

False Negative Rate (FNR): This metric is a method for determining the case that the condition does not hold, while in fact it does. In this work, we also called it miss rate. This metrics can be calculated as (12):

[TABLE]

5.1.2 Datasets

Our experiments utilized the following three datasets:

•

Drebin dataset: This dataset is an Android example collection that we can apply directly. The Drebin dataset includes 118,505 applications/samples from various Android sources arp2014drebin .

•

Contagio dataset: it consists of 11,960 mobile malware samples and 16,800 benign samples Contagio .

•

Genome dataset: This dataset is an Android example which is supported by the National Science Foundation (NSF) project of the United States. From August 2010 to October 2011, the authors collected about 1,200 samples of Android malware from different categories as a genome dataset jiang2012dissecting .

5.1.3 Features

In this paper, we consider various malicious sample features like Permissions, APIs and Intents. We summarize them as follows:

•

Permission: Permission is a essential profile of an Android application (apk) file that includes information about the application. The Android operating system processes these Permission files before installation.

•

API: API feature monitors various calls to APIs on an Android OS, such as sending SMS or accessing a user’s location.

•

Intent: Intent feature applies to represent the communication between different components which is known as a medium.

5.1.4 Parameter setting

We rank the features to better manage the huge amount of features using the RandomForestRegressor algorithm. Then, we repeat our experiments for 300 manifest features with higher ranks to determine the optimal number of features for modification in each method. In each test, we randomly consider 60% of the dataset as training samples, 20% as validation samples and 20% as testing samples. We run our experiments on an 8-core Intel Core i7 with speed 4 GHz with 16 GB RAM on an OS Win10 64-bit.

5.1.5 Comparison of defense algorithms

We compare our proposed algorithms to defend against label flipping attacks with KNN-based Semi-Supervised Defense (KSSD) paudice2018label and GAN-based Defense taheri2019can . The comparison results show that our proposed methods are more robust in detecting Flipping Label attacks. In the KSSD method, authors adopt $K$ nearest neighbor (KNN) method to mitigate the effect of label flipping attacks. A relabeling mechanism for suspected malicious malware is suggested. The KNN algorithm uses the training set to assign a label to each sample. The aim is to ensure the homogeneity of the label between the close examples, especially in areas that are far from the decision boundary. In the training set, authors first select $K$ nearest neighbors using the Euclidean distance. Then, if the fraction of the data points that are among the most commonly enclosed labels in $K$ are equal or greater than the threshold of $t$ with $0.5\leq t\leq 1$ they select them. The training sample available in the $K$ nearest neighbor is relabeled with the most common label. Given that we only have two types of labels in detecting malware, they assign the dominant label in $K$ to the nearest neighbor to the sample. Algorithm 4 presents the KSSD defense.

We indicate that poisoning sample points that are far from the decision boundary are likely to be relabeled, and reduce the negative performance consequences on the classification algorithm. Although the algorithm gains validation of genuine points at the same time, i.e., in areas where the two classes overlap (especially for values of t close to 0.5), we can have a similar amount of the correct points that are labeled in two classes, and it confirms that the KSSD label correction solutions presented in Algorithm 4 must be the same for the two classes. Therefore, this type labeling shall not considerably influenced the classification algorithm.

Another comparison made in this paper is the GAN-based defense presented in taheri2019can . Algorithm 5.1.5 illustrates the proposed method in this study. This algorithm works by generating new samples to train the machine learning model again. Specifically, in this paper, we use the GAN as a synthetic data generator set. GAN has two functions called Generator and Discriminator. The former one can modify the less likely malware samples. To do so, in the training phase, it selects one random feature from the highest ranked features with zero value. Then, it changes the selected feature value to one to generate new sample. In the latter function, the GAN use this function as a classifier to predict the class variable. It modifies the features until the discriminator function is cheated and labels such a sample among the benign samples. Besides, we gather the wrongly estimated malware samples into a synthetic data generator set. Besides, we use 80% of the synthetic data generator set with the training dataset to update the AML model. We use the remaining synthetic data generator samples (i.e., 20% of the data samples) with the test dataset to test/analyze the classification. It is found that the proposed methods even outperform the GAN-based method, since the proposed GAN is only flipping-focused research with respect to the important features of decision making, while the proposed methods in this paper are based on the value of labels.

Bibliography38

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) Contagio dataset. http://contagiominidump.blogspot.com/ (2020). [Online; accessed 22-February-2020]
2(2) Label propagation. https://scikit-learn.org/stable/modules/label_propagation.html (2020). [Online; accessed 22-February-2020]
3(3) Mutual information. https://nlp.stanford.edu/IR-book/html/htmledition/mutual-information-1.html (2020). [Online; accessed 22-February-2020]
4(4) Arp, D., Spreitzenbarth, M., Hubner, M., Gascon, H., Rieck, K., Siemens, C.: Drebin: Effective and explainable detection of android malware in your pocket. In: Ndss, vol. 14, pp. 23–26 (2014)
5(5) Aviles-Rivero, A.I., Papadakis, N., Li, R., Alsaleh, S.M., Tan, R.T., Schonlieb, C.B.: Beyond supervised classification: Extreme minimal supervision with the graph 1-laplacian. ar Xiv preprint ar Xiv:1906.08635 (2019)
6(6) Baracaldo, N., Chen, B., Ludwig, H., Safavi, A., Zhang, R.: Detecting poisoning attacks on machine learning in iot environments. In: 2018 IEEE International Congress on Internet of Things (ICIOT), pp. 57–64. IEEE (2018)
7(7) Bhagoji, A.N., Cullina, D., Mittal, P.: Dimensionality reduction as a defense against evasion attacks on machine learning classifiers. ar Xiv preprint ar Xiv:1704.02654 (2017)
8(8) Bootkrajang, J.: A generalised label noise model for classification in the presence of annotation errors. Neurocomputing 192 , 61–71 (2016)