Less is More: A privacy-respecting Android malware classifier using   Federated Learning

Rafa G\'alvez; Veelasha Moonsamy; Claudia Diaz

arXiv:2007.08319·cs.CR·June 17, 2021

Less is More: A privacy-respecting Android malware classifier using Federated Learning

Rafa G\'alvez, Veelasha Moonsamy, Claudia Diaz

PDF

1 Repo

TL;DR

LiM is a privacy-preserving Android malware classifier using Federated Learning, achieving high accuracy and robustness while keeping user app data local and secure against attacks.

Contribution

This paper introduces LiM, a novel federated learning framework for malware detection that maintains user privacy and is resilient to poisoning and inference attacks.

Findings

01

Cloud F1 score of 95%

02

Clients have perfect recall with minimal false positives

03

Robust against poisoning and inference attacks

Abstract

In this paper we present LiM ("Less is More"), a malware classification framework that leverages Federated Learning to detect and classify malicious apps in a privacy-respecting manner. Information about newly installed apps is kept locally on users' devices, so that the provider cannot infer which apps were installed by users. At the same time, input from all users is taken into account in the federated learning process and they all benefit from better classification performance. A key challenge of this setting is that users do not have access to the ground truth (i.e. they cannot correctly identify whether an app is malicious). To tackle this, LiM uses a safe semi-supervised ensemble that maximizes classification accuracy with respect to a baseline classifier trained by the service provider (i.e. the cloud). We implement LiM and show that the cloud server has F1 score of 95%, while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

brechtvandervliet/ResistancePoisoningFederatedMalwareClassifier
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.