SC2EGSet: StarCraft II Esport Replay and Game-state Dataset
Andrzej Bia{\l}ecki, Natalia Jakubowska, Pawe{\l} Dobrowolski, Piotr, Bia{\l}ecki, Leszek Krupi\'nski, Andrzej Szczap, Robert Bia{\l}ecki, Jan, Gajewski

TL;DR
This paper introduces SC2EGSet, a comprehensive and large-scale dataset of StarCraft II esports replays and game states, along with open-source tools, to facilitate scientific research in AI, ML, psychology, and HCI.
Contribution
The authors provide the largest publicly available StarCraft II esports dataset with raw and processed data, plus open-source tools for data extraction and modeling, enabling broader scientific analysis.
Findings
Largest publicly available StarCraft II esports dataset
Processed 55 tournament replay packs with 17,930 files
Tools for data loading and modeling are open-sourced
Abstract
As a relatively new form of sport, esports offers unparalleled data availability. Despite the vast amounts of data that are generated by game engines, it can be challenging to extract them and verify their integrity for the purposes of practical and scientific use. Our work aims to open esports to a broader scientific community by supplying raw and pre-processed files from StarCraft II esports tournaments. These files can be used in statistical and machine learning modeling tasks and related to various laboratory-based measurements (e.g., behavioral tests, brain imaging). We have gathered publicly available game-engine generated "replays" of tournament matches and performed data extraction and cleanup using a low-level application programming interface (API) parser library. Additionally, we open-sourced and published all the custom tools that were developed in the process of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Games and Media · Gambling Behavior and Treatments · Artificial Intelligence in Games
