Automatic Clustering of a Network Protocol with Weakly-Supervised Clustering
Tobias Schrank, Franz Pernkopf

TL;DR
This paper introduces an automatic, weakly-supervised clustering method to create behavioral abstractions of network protocols, demonstrated on TLS, reducing manual effort and matching reference models with minimal labeled data.
Contribution
The paper presents a novel weakly-supervised clustering approach for automatic protocol abstraction, reducing reliance on manual domain expertise.
Findings
The method accurately reproduces reference abstractions with minimal labeled data.
It effectively reduces the vocabulary size of protocol messages.
The approach outperforms manual abstraction in consistency and efficiency.
Abstract
Abstraction is a fundamental part when learning behavioral models of systems. Usually the process of abstraction is manually defined by domain experts. This paper presents a method to perform automatic abstraction for network protocols. In particular a weakly supervised clustering algorithm is used to build an abstraction with a small vocabulary size for the widely used TLS protocol. To show the effectiveness of the proposed method we compare the resultant abstract messages to a manually constructed (reference) abstraction. With a small amount of side-information in the form of a few labeled examples this method finds an abstraction that matches the reference abstraction perfectly.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Software Testing and Debugging Techniques
