Large-scale Mobile App Identification Using Deep Learning
Shahbaz Rezaei, Bryce Kroencke, Xin Liu

TL;DR
This paper introduces a deep learning approach for mobile app identification from encrypted traffic, achieving high accuracy and addressing challenges like ambiguous flows and early prediction requirements.
Contribution
It presents a novel CNN+LSTM model that improves app identification accuracy, especially for ambiguous flows, and provides insights into data leakage through SSL/TLS.
Findings
Achieves 84-98% accuracy in identifying 80 popular apps.
Uses early packet payloads for real-time classification.
First to identify source apps for ambiguous flows.
Abstract
Many network services and tools (e.g. network monitors, malware-detection systems, routing and billing policy enforcement modules in ISPs) depend on identifying the type of traffic that passes through the network. With the widespread use of mobile devices, the vast diversity of mobile apps, and the massive adoption of encryption protocols (such as TLS), large-scale encrypted traffic classification becomes increasingly difficult. In this paper, we propose a deep learning model for mobile app identification that works even with encrypted traffic. The proposed model only needs the payload of the first few packets for classification, and, hence, it is suitable even for applications that rely on early prediction, such as routing and QoS provisioning. The deep model achieves between 84% to 98% accuracy for the identification of 80 popular apps. We also perform occlusion analysis to bring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
