Reward Function Design for Crowd Simulation via Reinforcement Learning

Ariel Kwiatkowski; Vicky Kalogeiton; Julien Pettr\'e; Marie-Paule Cani

arXiv:2309.12841·cs.LG·September 25, 2023

Reward Function Design for Crowd Simulation via Reinforcement Learning

Ariel Kwiatkowski, Vicky Kalogeiton, Julien Pettr\'e, Marie-Paule Cani

PDF

TL;DR

This paper investigates how to design reward functions for reinforcement learning-based crowd simulation, providing theoretical insights and empirical evaluations to improve the realism and efficiency of virtual crowds.

Contribution

It offers a theoretical analysis of reward function validity and empirically evaluates energy-based rewards with guiding potentials for crowd simulation.

Findings

01

Direct energy minimization is effective with proper scaling.

02

Reward components significantly influence crowd behavior.

03

Guiding potentials improve energy efficiency in simulations.

Abstract

Crowd simulation is important for video-games design, since it enables to populate virtual worlds with autonomous avatars that navigate in a human-like manner. Reinforcement learning has shown great potential in simulating virtual crowds, but the design of the reward function is critical to achieving effective and efficient results. In this work, we explore the design of reward functions for reinforcement learning-based crowd simulation. We provide theoretical insights on the validity of certain reward functions according to their analytical properties, and evaluate them empirically using a range of scenarios, using the energy efficiency as the metric. Our experiments show that directly minimizing the energy usage is a viable strategy as long as it is paired with an appropriately scaled guiding potential, and enable us to study the impact of the different reward components on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.