The Shape of Money Laundering: Subgraph Representation Learning on the Blockchain with the Elliptic2 Dataset
Claudio Bellei, Muhua Xu, Ross Phillips, Tom Robinson, Mark Weber, Tim, Kaler, Charles E. Leiserson, Arvind, Jie Chen

TL;DR
This paper introduces Elliptic2, a large dataset of labeled subgraphs from Bitcoin transactions, enabling graph neural network analysis of money laundering patterns for improved AML detection.
Contribution
The paper presents a new large-scale dataset and software tools for subgraph representation learning, specifically tailored for anti-money laundering in cryptocurrency networks.
Findings
Early experimental results show promising classification accuracy.
The dataset enables modeling of illicit activity shapes in blockchain networks.
Practical value demonstrated for AML and forensic analytics.
Abstract
Subgraph representation learning is a technique for analyzing local structures (or shapes) within complex networks. Enabled by recent developments in scalable Graph Neural Networks (GNNs), this approach encodes relational information at a subgroup level (multiple connected nodes) rather than at a node level of abstraction. We posit that certain domain applications, such as anti-money laundering (AML), are inherently subgraph problems and mainstream graph techniques have been operating at a suboptimal level of abstraction. This is due in part to the scarcity of annotated datasets of real-world size and complexity, as well as the lack of software tools for managing subgraph GNN workflows at scale. To enable work in fundamental algorithms as well as domain applications in AML and beyond, we introduce Elliptic2, a large graph dataset containing 122K labeled subgraphs of Bitcoin clusters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCrime, Illicit Activities, and Governance · Blockchain Technology Applications and Security
MethodsSparse Evolutionary Training
