Techniques and Applications for Crawling, Ingesting and Analyzing Blockchain Data
Evan Brinckman, Andrey Kuehlkamp, Jarek Nabrzyski, Ian J. Taylor

TL;DR
This paper discusses methods for efficiently crawling, ingesting, and analyzing blockchain data, addressing scalability and semantic challenges, and demonstrating applications in scientific workflows and anomaly detection using machine learning.
Contribution
It introduces techniques for scalable blockchain data extraction that preserve metadata and applies unsupervised machine learning for anomaly detection and analysis.
Findings
Effective cross-referencing of smart contracts and on-chain data
Successful application of machine learning algorithms for outlier detection
Correlation of blockchain data analysis with public web sources
Abstract
As the public Ethereum network surpasses half a billion transactions and enterprise Blockchain systems becoming highly capable of meeting the demands of global deployments, production Blockchain applications are fast becoming commonplace across a diverse range of business and scientific verticals. In this paper, we reflect on work we have been conducting recently surrounding the ingestion, retrieval and analysis of Blockchain data. We describe the scaling and semantic challenges when extracting Blockchain data in a way that preserves the original metadata of each transaction by cross referencing the Smart Contract interface with the on-chain data. We then discuss a scientific use case in the area of Scientific workflows by describing how we can harvest data from tasks and dependencies in a generic way. We then discuss how crawled public blockchain data can be analyzed using two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
