Online Reinforcement Learning-Based Dynamic Adaptive Evaluation Function for Real-Time Strategy Tasks
Weilong Yang, Jie Zhang, Xunyun Liu, Yanqing Ye

TL;DR
This paper introduces an online reinforcement learning method to dynamically adjust evaluation functions in real-time strategy games, significantly improving their responsiveness and effectiveness while maintaining low computational overhead.
Contribution
It presents a novel online reinforcement learning approach with gradient descent, weight decay, and AdamW optimizer to adapt evaluation functions in real-time strategy tasks.
Findings
Enhanced evaluation accuracy in larger maps.
Significant score improvements in planning algorithms.
Evaluation time increase remains below 6%.
Abstract
Effective evaluation of real-time strategy tasks requires adaptive mechanisms to cope with dynamic and unpredictable environments. This study proposes a method to improve evaluation functions for real-time responsiveness to battle-field situation changes, utilizing an online reinforcement learning-based dynam-ic weight adjustment mechanism within the real-time strategy game. Building on traditional static evaluation functions, the method employs gradient descent in online reinforcement learning to update weights dynamically, incorporating weight decay techniques to ensure stability. Additionally, the AdamW optimizer is integrated to adjust the learning rate and decay rate of online reinforcement learning in real time, further reducing the dependency on manual parameter tun-ing. Round-robin competition experiments demonstrate that this method signifi-cantly enhances the application…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsWeight Decay · AdamW
