Umbrella Reinforcement Learning -- computationally efficient tool for   hard non-linear problems

Egor E. Nuzhin; Nikolai V. Brilliantov

arXiv:2411.14117·cs.LG·February 28, 2025

Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems

Egor E. Nuzhin, Nikolai V. Brilliantov

PDF

1 Repo

TL;DR

This paper introduces Umbrella Reinforcement Learning, a novel and efficient method combining umbrella sampling with neural network policy gradients to solve complex nonlinear RL problems more effectively than existing algorithms.

Contribution

It presents a new approach that integrates umbrella sampling into RL using neural networks, enhancing computational efficiency and exploration capabilities.

Findings

01

Outperforms state-of-the-art algorithms in hard RL tasks

02

Efficiently handles sparse rewards and state traps

03

Utilizes ensemble agents with modified rewards for better exploration

Abstract

We report a novel, computationally efficient approach for solving hard nonlinear problems of reinforcement learning (RL). Here we combine umbrella sampling, from computational physics/chemistry, with optimal control methods. The approach is realized on the basis of neural networks, with the use of policy gradient. It outperforms, by computational efficiency and implementation universality, all available state-of-the-art algorithms, in application to hard RL problems with sparse reward, state traps and lack of terminal states. The proposed approach uses an ensemble of simultaneously acting agents, with a modified reward which includes the ensemble entropy, yielding an optimal exploration-exploitation balance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

enuzhin/ur
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsUmbrella Reinforcement Learning