Mapping the Internet: Modelling Entity Interactions in Complex Heterogeneous Networks
Simon Mandlik, Tomas Pevny

TL;DR
The paper introduces HMill, a versatile framework for modeling complex heterogeneous networks, enabling flexible machine learning on diverse data types, with theoretical justification and successful cybersecurity applications.
Contribution
HMill is a unified, flexible framework that extends multi-instance learning and includes a universal approximation theorem, facilitating modeling of heterogeneous, hierarchical, and missing data.
Findings
Achieved comparable performance to specialized methods in cybersecurity tasks
Extended universal approximation theorem for models in the framework
Demonstrated versatility across three diverse cybersecurity applications
Abstract
Even though machine learning algorithms already play a significant role in data science, many current methods pose unrealistic assumptions on input data. The application of such methods is difficult due to incompatible data formats, or heterogeneous, hierarchical or entirely missing data fragments in the dataset. As a solution, we propose a versatile, unified framework called `HMill' for sample representation, model definition and training. We review in depth a multi-instance paradigm for machine learning that the framework builds on and extends. To theoretically justify the design of key components of HMill, we show an extension of the universal approximation theorem to the set of all functions realized by models implemented in the framework. The text also contains a detailed discussion on technicalities and performance improvements in our implementation, which is published for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Network Security and Intrusion Detection · Advanced Graph Neural Networks
