A Multimodal Architecture for Endpoint Position Prediction in Team-based Multiplayer Games

Jonas Peche; Aliaksei Tsishurou; Alexander Zap; Guenter Wallner

arXiv:2507.20670·cs.CV·July 29, 2025

A Multimodal Architecture for Endpoint Position Prediction in Team-based Multiplayer Games

Jonas Peche, Aliaksei Tsishurou, Alexander Zap, Guenter Wallner

PDF

TL;DR

This paper introduces a multimodal, U-Net-based architecture utilizing multi-head attention to predict future player locations in multiplayer games, integrating diverse data types for improved accuracy and enabling advanced game analytics.

Contribution

The paper presents a novel multimodal architecture that effectively combines heterogeneous game data for accurate endpoint position prediction in complex multiplayer environments.

Findings

01

Achieved accurate future player location predictions.

02

Effectively integrated image, numerical, and categorical data.

03

Enabled downstream tasks like bot behavior modeling and anomaly detection.

Abstract

Understanding and predicting player movement in multiplayer games is crucial for achieving use cases such as player-mimicking bot navigation, preemptive bot control, strategy recommendation, and real-time player behavior analytics. However, the complex environments allow for a high degree of navigational freedom, and the interactions and team-play between players require models that make effective use of the available heterogeneous input data. This paper presents a multimodal architecture for predicting future player locations on a dynamic time horizon, using a U-Net-based approach for calculating endpoint location probability heatmaps, conditioned using a multimodal feature encoder. The application of a multi-head attention mechanism for different groups of features allows for communication between agents. In doing so, the architecture makes efficient use of the multimodal game state…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.