Monitoring Data Requests in Decentralized Data Storage Systems: A Case Study of IPFS
Leonhard Balduf, Sebastian Henningsen, Martin Florian, Sebastian Rust,, Bj\"orn Scheuermann

TL;DR
This paper introduces a passive monitoring method for IPFS, revealing network size, activity, content popularity, and privacy vulnerabilities, with implications for privacy protection and network analysis.
Contribution
The paper presents a novel passive monitoring approach for IPFS that enables large-scale data request analysis and uncovers privacy risks, which was not previously feasible.
Findings
The methodology successfully monitors a significant portion of IPFS nodes.
Insights into network size, activity, and content popularity are obtained.
Privacy vulnerabilities, including node surveillance, are demonstrated.
Abstract
Decentralized data storage systems like the Interplanetary Filesystem (IPFS) are becoming increasingly popular, e. g., as a data layer in blockchain applications and for sharing content in a censorship-resistant manner. In IPFS, data is hosted by an open set of nodes and data requests are broadcast to connected peers in addition to being routed via a distributed hash table (DHT). In this paper, we present a passive monitoring methodology that exploits this design for obtaining data requests from a significant and upscalable portion of nodes. Using an implementation of our approach for the IPFS network and data collected over a period of fifteen months, we demonstrate how our methodology enables profound insights into, among other things: the size of the IPFS network, activity levels and structure, and content popularity distributions. We furthermore present that our methodology can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Peer-to-Peer Network Technologies · Advanced Data Storage Technologies
