Multi-Agent Reinforcement Learning and Real-Time Decision-Making in Robotic Soccer for Virtual Environments

Aya Taourirte; Md Sohag Mia

arXiv:2512.03166·cs.RO·December 4, 2025

Multi-Agent Reinforcement Learning and Real-Time Decision-Making in Robotic Soccer for Virtual Environments

Aya Taourirte, Md Sohag Mia

PDF

Open Access

TL;DR

This paper introduces a scalable multi-agent reinforcement learning framework for robotic soccer, combining hierarchical RL, mean-field theory, and real-time decision-making to improve cooperation, strategy, and performance in dynamic environments.

Contribution

It presents a novel unified MARL framework integrating hierarchical RL and mean-field theory, optimized for real-time multi-agent decision-making in robotic soccer.

Findings

01

Proximal Policy Optimization outperforms previous methods with 4.32 goals and 82.9% ball control.

02

Hierarchical RL increases average goals to 5.26 by decomposing strategies.

03

Mean-field actor-critic achieves 5.93 goals, 89.1% ball control, and 92.3% passing accuracy.

Abstract

The deployment of multi-agent systems in dynamic, adversarial environments like robotic soccer necessitates real-time decision-making, sophisticated cooperation, and scalable algorithms to avoid the curse of dimensionality. While Reinforcement Learning (RL) offers a promising framework, existing methods often struggle with the multi-granularity of tasks (long-term strategy vs. instant actions) and the complexity of large-scale agent interactions. This paper presents a unified Multi-Agent Reinforcement Learning (MARL) framework that addresses these challenges. First, we establish a baseline using Proximal Policy Optimization (PPO) within a client-server architecture for real-time action scheduling, with PPO demonstrating superior performance (4.32 avg. goals, 82.9% ball control). Second, we introduce a Hierarchical RL (HRL) structure based on the options framework to decompose the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Adaptive Dynamic Programming Control