player2vec: A Language Modeling Approach to Understand Player Behavior in Games
Tianze Wang, Maryam Honari-Jahromi, Styliani Katsarou, Olga Mikheeva,, Theodoros Panagiotakopoulos, Sahar Asadi, Oleg Smirnov

TL;DR
This paper introduces player2vec, a novel Transformer-based method that models player behavior in games by treating in-game events as language tokens, enabling self-supervised learning of player representations.
Contribution
It extends language modeling techniques to gaming data, providing a new way to understand and analyze player behavior without ground-truth labels.
Findings
Effective in modeling behavior event distributions
Learned embeddings reveal meaningful behavior patterns
Supports downstream analysis and insights
Abstract
Methods for learning latent user representations from historical behavior logs have gained traction for recommendation tasks in e-commerce, content streaming, and other settings. However, this area still remains relatively underexplored in video and mobile gaming contexts. In this work, we present a novel method for overcoming this limitation by extending a long-range Transformer model from the natural language processing domain to player behavior data. We discuss specifics of behavior tracking in games and propose preprocessing and tokenization approaches by viewing in-game events in an analogous way to words in sentences, thus enabling learning player representations in a self-supervised manner in the absence of ground-truth annotations. We experimentally demonstrate the efficacy of the proposed approach in fitting the distribution of behavior events by evaluating intrinsic language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Topic Modeling
MethodsAttention Is All You Need · Softmax · Linear Layer · Dense Connections · Label Smoothing · Position-Wise Feed-Forward Layer · Residual Connection · Dropout · Layer Normalization · Multi-Head Attention
