Multi-source Multi-level Multi-token Ethereum Dataset and Benchmark Platform
Haoyuan Li, Mengxiao Zhang, Maoyuan Li, Jianzheng Li, Junyi Yang,, Shuangyan Deng, Zijian Zhang, Jiamou Liu

TL;DR
The paper presents 3MEthTaskforce, a comprehensive multi-source Ethereum dataset with benchmarks for user behavior and token price prediction, supporting advanced blockchain analytics and DeFi research.
Contribution
It introduces a large-scale, multi-level, multi-token Ethereum dataset with benchmark tasks and evaluation models, addressing limitations of existing datasets.
Findings
Over 300 million transaction records included
Benchmark performance established for prediction tasks
Supports multimodal analysis for risk and market fluctuation
Abstract
This paper introduces 3MEthTaskforce (https://3meth.github.io), a multi-source, multi-level, and multi-token Ethereum dataset addressing the limitations of single-source datasets. Integrating over 300 million transaction records, 3,880 token profiles, global market indicators, and Reddit sentiment data from 2014-2024, it enables comprehensive studies on user behavior, market sentiment, and token performance. 3MEthTaskforce defines benchmarks for user behavior prediction and token price prediction tasks, using 6 dynamic graph networks and 19 time-series models to evaluate performance. Its multimodal design supports risk analysis and market fluctuation modeling, providing a valuable resource for advancing blockchain analytics and decentralized finance research.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Advanced Data Storage Technologies · Medical Imaging Techniques and Applications
