Data Analytics-enabled Intrusion Detection: Evaluations of ToN_IoT Linux Datasets
Nour Moustafa, Mohiuddin Ahmed, Sherif Ahmed

TL;DR
This paper introduces the ToN IoT datasets, a comprehensive collection of heterogeneous data sources from IoT, operating systems, and network traffic, designed to evaluate AI-enabled security applications.
Contribution
It presents a novel distributed testbed architecture for collecting Linux datasets from audit traces, integrating edge, fog, and cloud layers with SDN and NFV control.
Findings
Datasets effectively capture legitimate and security events.
Data analysis shows high fidelity and reliability of security event features.
The datasets support training and validation of advanced AI security solutions.
Abstract
With the widespread of Artificial Intelligence (AI)- enabled security applications, there is a need for collecting heterogeneous and scalable data sources for effectively evaluating the performances of security applications. This paper presents the description of new datasets, named ToN IoT datasets that include distributed data sources collected from Telemetry datasets of Internet of Things (IoT) services, Operating systems datasets of Windows and Linux, and datasets of Network traffic. The paper aims to describe the new testbed architecture used to collect Linux datasets from audit traces of hard disk, memory and process. The architecture was designed in three distributed layers of edge, fog, and cloud. The edge layer comprises IoT and network systems, the fog layer includes virtual machines and gateways, and the cloud layer includes data analytics and visualization tools connected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
