The Unreasonable Effectiveness of Address Clustering

Martin Harrigan; Christoph Fretter

arXiv:1605.06369·cs.CR·September 17, 2018

The Unreasonable Effectiveness of Address Clustering

Martin Harrigan, Christoph Fretter

PDF

TL;DR

This paper investigates why address clustering techniques are highly effective in Bitcoin, analyzing factors like address reuse and cluster growth over seven years.

Contribution

It identifies and quantifies key reasons behind address clustering effectiveness, providing insights into Bitcoin's address structure.

Findings

01

Address reuse significantly aids clustering accuracy

02

Super-clusters exhibit high centrality, facilitating entity identification

03

Incremental cluster growth reflects Bitcoin's evolving address network

Abstract

Address clustering tries to construct the one-to-many mapping from entities to addresses in the Bitcoin system. Simple heuristics based on the micro-structure of transactions have proved very effective in practice. In this paper we describe the primary reasons behind this effectiveness: address reuse, avoidable merging, super-clusters with high centrality, and the incremental growth of address clusters. We quantify their impact during Bitcoin's first seven years of existence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.