Unified Host and Network Data Set
Melissa J. M. Turcotte, Alexander D. Kent, Curtis Hash

TL;DR
This paper introduces a large, anonymized data set from an operational enterprise network at Los Alamos National Laboratory, aiming to advance cybersecurity research by providing valuable real-world data.
Contribution
It presents a comprehensive, anonymized data set from an operational environment, addressing the scarcity of realistic cybersecurity data for research.
Findings
Provides a large, anonymized operational network data set
Facilitates new cybersecurity research and analysis
Encourages other organizations to release similar data
Abstract
The lack of data sets derived from operational enterprise networks continues to be a critical deficiency in the cyber security research community. Unfortunately, releasing viable data sets to the larger com- munity is challenging for a number of reasons, primarily the difficulty of balancing security and privacy concerns against the fidelity and utility of the data. This chapter discusses the importance of cyber secu- rity research data sets and introduces a large data set derived from the operational network environment at Los Alamos National Laboratory. The hope is that this data set and associated discussion will act as a catalyst for both new research in cyber security as well as motivation for other organizations to release similar data sets to the community.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation and Cyber Security · Network Security and Intrusion Detection · Software System Performance and Reliability
