Using Monte Carlo Search With Data Aggregation to Improve Robot Soccer   Policies

Francesco Riccio; Roberto Capobianco; Daniele Nardi

arXiv:1606.00285·cs.RO·June 2, 2016

Using Monte Carlo Search With Data Aggregation to Improve Robot Soccer Policies

Francesco Riccio, Roberto Capobianco, Daniele Nardi

PDF

TL;DR

This paper presents a novel Monte Carlo search with data aggregation method to enhance robot soccer policies, leading to better interception and positioning in dynamic, partially observable environments.

Contribution

Introduces MCSDA, a new approach combining Monte Carlo search and data aggregation to improve robot soccer policies through supervised learning and iterative refinement.

Findings

01

Improved ball interception rates.

02

Reduced opponents' goals.

03

Enhanced team positioning efficiency.

Abstract

RoboCup soccer competitions are considered among the most challenging multi-robot adversarial environments, due to their high dynamism and the partial observability of the environment. In this paper we introduce a method based on a combination of Monte Carlo search and data aggregation (MCSDA) to adapt discrete-action soccer policies for a defender robot to the strategy of the opponent team. By exploiting a simple representation of the domain, a supervised learning algorithm is trained over an initial collection of data consisting of several simulations of human expert policies. Monte Carlo policy rollouts are then generated and aggregated to previous data to improve the learned policy over multiple epochs and games. The proposed approach has been extensively tested both on a soccer-dedicated simulator and on real robots. Using this method, our learning robot soccer team achieves an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.