Exploring Network-Wide Flow Data with Flowyager
Said Jawad Saidi, Aniss Maghsoudlou, Damien Foucard, Georgios, Smaragdakis, Ingmar Poese, Anja Feldmann

TL;DR
Flowyager is a system that efficiently summarizes and queries network-wide flow data, significantly reducing response times and storage requirements, thereby enabling rapid analysis for network management and security.
Contribution
We introduce Flowyager, a novel system that creates self-adjusted Flowtrees to summarize flow data, drastically reducing space and transfer needs while supporting fast, structured network-wide queries.
Findings
Flowyager reduces storage and transfer by 75-95% compared to raw data.
It achieves an order of magnitude faster query response times.
Enables interactive, network-wide flow analysis for security and management.
Abstract
Many network operations, ranging from attack investigation and mitigation to traffic management, require answering network-wide flow queries in seconds. Although flow records are collected at each router, using available traffic capture utilities, querying the resulting datasets from hundreds of routers across sites and over time, remains a significant challenge due to the sheer traffic volume and distributed nature of flow records. In this paper, we investigate how to improve the response time for a priori unknown network-wide queries. We present Flowyager, a system that is built on top of existing traffic capture utilities. Flowyager generates and analyzes tree data structures, that we call Flowtrees, which are succinct summaries of the raw flow data available by capture utilities. Flowtrees are self-adjusted data structures that drastically reduce space and transfer requirements,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
