Exploring Unsupervised Learning Methods for Automated Protocol Analysis
Arijit Dasgupta, Yi-Xue Yan, Clarence Ong, Jenn-Yue Teo, Chia-Wei Lim

TL;DR
This paper evaluates various unsupervised learning methods for automated protocol analysis, introduces a novel feature extraction approach, and demonstrates its robustness and superior performance across diverse datasets.
Contribution
It proposes a comprehensive framework for evaluating and optimizing unsupervised clustering methods in automated protocol analysis, including a new hybrid approach that outperforms existing techniques.
Findings
The hybrid approach outperformed in 7 of 9 datasets.
It outperformed NETZOB in all datasets.
The automated parameter selection improved performance.
Abstract
The ability to analyse and differentiate network protocol traffic is crucial for network resource management to provide differentiated services by Telcos. Automated Protocol Analysis (APA) is crucial to significantly improve efficiency and reduce reliance on human experts. There are numerous automated state-of-the-art unsupervised methods for clustering unknown protocols in APA. However, many such methods have not been sufficiently explored using diverse test datasets. Thus failing to demonstrate their robustness to generalise. This study proposed a comprehensive framework to evaluate various combinations of feature extraction and clustering methods in APA. It also proposed a novel approach to automate selection of dataset dependent model parameters for feature extraction, resulting in improved performance. Promising results of a novel field-based tokenisation approach also led to our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Internet Traffic Analysis and Secure E-voting · Network Packet Processing and Optimization
MethodsAdaptive Pseudo Augmentation
