Online Reinforcement Learning-Based Dynamic Adaptive Evaluation Function   for Real-Time Strategy Tasks

Weilong Yang; Jie Zhang; Xunyun Liu; Yanqing Ye

arXiv:2501.03824·cs.AI·January 8, 2025

Online Reinforcement Learning-Based Dynamic Adaptive Evaluation Function for Real-Time Strategy Tasks

Weilong Yang, Jie Zhang, Xunyun Liu, Yanqing Ye

PDF

Open Access

TL;DR

This paper introduces an online reinforcement learning method to dynamically adjust evaluation functions in real-time strategy games, significantly improving their responsiveness and effectiveness while maintaining low computational overhead.

Contribution

It presents a novel online reinforcement learning approach with gradient descent, weight decay, and AdamW optimizer to adapt evaluation functions in real-time strategy tasks.

Findings

01

Enhanced evaluation accuracy in larger maps.

02

Significant score improvements in planning algorithms.

03

Evaluation time increase remains below 6%.

Abstract

Effective evaluation of real-time strategy tasks requires adaptive mechanisms to cope with dynamic and unpredictable environments. This study proposes a method to improve evaluation functions for real-time responsiveness to battle-field situation changes, utilizing an online reinforcement learning-based dynam-ic weight adjustment mechanism within the real-time strategy game. Building on traditional static evaluation functions, the method employs gradient descent in online reinforcement learning to update weights dynamically, incorporating weight decay techniques to ensure stability. Additionally, the AdamW optimizer is integrated to adjust the learning rate and decay rate of online reinforcement learning in real time, further reducing the dependency on manual parameter tun-ing. Round-robin competition experiments demonstrate that this method signifi-cantly enhances the application…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsWeight Decay · AdamW