Three-dimensional attention Transformer for state evaluation in real-time strategy games
Yanqing Ye, Weilong Yang, Kai Qiu, Jie Zhang

TL;DR
This paper introduces a tri-dimensional Transformer architecture for real-time strategy game situation assessment, effectively modeling spatial, temporal, and feature information to improve accuracy and efficiency.
Contribution
It proposes a novel tri-dimensional Space-Time-Feature Transformer that outperforms existing models in RTS game state evaluation with fewer parameters.
Findings
Achieves 58.7% accuracy early game, surpassing Timesformer.
Reaches 97.6% mid-game accuracy with low variation.
Uses fewer parameters than baseline models.
Abstract
Situation assessment in Real-Time Strategy (RTS) games is crucial for understanding decision-making in complex adversarial environments. However, existing methods remain limited in processing multi-dimensional feature information and temporal dependencies. Here we propose a tri-dimensional Space-Time-Feature Transformer (TSTF Transformer) architecture, which efficiently models battlefield situations through three independent but cascaded modules: spatial attention, temporal attention, and feature attention. On a dataset comprising 3,150 adversarial experiments, the 8-layer TSTF Transformer demonstrates superior performance: achieving 58.7% accuracy in the early game (~4% progress), significantly outperforming the conventional Timesformer's 41.8%; reaching 97.6% accuracy in the mid-game (~40% progress) while maintaining low performance variation (standard deviation 0.114). Meanwhile,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Intelligent Tutoring Systems and Adaptive Learning
MethodsAttention Is All You Need · Absolute Position Encodings · Softmax · Linear Layer · Adam · Residual Connection · Dropout · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing
