What is Normal? A Big Data Observational Science Model of Anonymized Internet Traffic
Jeremy Kepner, Hayden Jananthan, Michael Jones, William Arcand, David, Bestor, William Bergeron, Daniel Burrill, Aydin Buluc, Chansup Byun, Timothy, Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Piotr, Luszczek, Lauren Milechin, Chasen Milner

TL;DR
This paper presents a scalable observational science model of anonymized Internet traffic using high-performance graph libraries and supercomputers to define normal behavior for anomaly detection while preserving privacy.
Contribution
It introduces a novel approach combining graph-based models and high-performance computing to analyze anonymized Internet traffic for normal behavior modeling.
Findings
Successfully synthesized low-parameter models of anonymized traffic
Demonstrated scalability with high-performance graph libraries
Enhanced privacy preservation in traffic analysis
Abstract
Understanding what is normal is a key aspect of protecting a domain. Other domains invest heavily in observational science to develop models of normal behavior to better detect anomalies. Recent advances in high performance graph libraries, such as the GraphBLAS, coupled with supercomputers enables processing of the trillions of observations required. We leverage this approach to synthesize low-parameter observational models of anonymized Internet traffic with a high regard for privacy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Big Data Technologies and Applications · Internet Traffic Analysis and Secure E-voting
