Risk Perspective Exploration in Distributional Reinforcement Learning

Jihwan Oh; Joonkee Kim; Se-Young Yun

arXiv:2206.14170·cs.LG·July 4, 2022·1 cites

Risk Perspective Exploration in Distributional Reinforcement Learning

Jihwan Oh, Joonkee Kim, Se-Young Yun

PDF

Open Access

TL;DR

This paper introduces risk scheduling methods in distributional reinforcement learning to improve exploration by leveraging risk levels and optimistic behaviors, demonstrating enhanced performance in multi-agent scenarios.

Contribution

It proposes novel risk scheduling approaches for exploration in distributional RL and shows their effectiveness in multi-agent environments.

Findings

01

Risk scheduling improves exploration efficiency.

02

Enhanced performance of DMIX with risk-based exploration.

03

Effective in multi-agent control tasks.

Abstract

Distributional reinforcement learning demonstrates state-of-the-art performance in continuous and discrete control settings with the features of variance and risk, which can be used to explore. However, the exploration method employing the risk property is hard to find, although numerous exploration methods in Distributional RL employ the variance of return distribution per action. In this paper, we present risk scheduling approaches that explore risk levels and optimistic behaviors from a risk perspective. We demonstrate the performance enhancement of the DMIX algorithm using risk scheduling in a multi-agent setting with comprehensive experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Software Engineering Methodologies · Advanced Multi-Objective Optimization Algorithms