The Risk-Utility Tradeoff for IP Address Truncation
Martin Burkhart, Daniela Brauckhoff, Martin May, Elisa Boschi

TL;DR
This paper investigates the balance between preserving privacy and maintaining data utility in IP address truncation, revealing that truncation effectively prevents host identification but can significantly impair anomaly detection, especially for internal addresses.
Contribution
It provides a formal analysis of the privacy-utility tradeoff in IP address truncation using real network traces and entropy-based metrics, highlighting the differential impact on internal versus external addresses.
Findings
Truncation prevents host identification effectively.
Data utility for anomaly detection degrades with increased truncation.
Internal address utility is lost with minimal truncation, external addresses are more resilient.
Abstract
Network operators are reluctant to share traffic data due to security and privacy concerns. Consequently, there is a lack of publicly available traces for validating and generalizing the latest results in network and security research. Anonymization is a possible solution in this context; however, it is unclear how the sanitization of data preserves characteristics important for traffic analysis. In addition, the privacy-preserving property of state-of-the-art IP address anonymization techniques has come into question by recent attacks that successfully identified a large number of hosts in anonymized traces. In this paper, we examine the tradeoff between data utility for anomaly detection and the risk of host identification for IP address truncation. Specifically, we analyze three weeks of unsampled and non-anonymized network traces from a medium-sized backbone network to assess data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Security and Resilience · Internet Traffic Analysis and Secure E-voting · Privacy-Preserving Technologies in Data
