Diversity Through Exclusion (DTE): Niche Identification for   Reinforcement Learning through Value-Decomposition

Peter Sunehag; Alexander Sasha Vezhnevets; Edgar Du\'e\~nez-Guzm\'an,; Igor Mordach; Joel Z. Leibo

arXiv:2302.01180·cs.AI·February 6, 2023

Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition

Peter Sunehag, Alexander Sasha Vezhnevets, Edgar Du\'e\~nez-Guzm\'an,, Igor Mordach, Joel Z. Leibo

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning algorithm that uses multiple sub-policies and a novel value-decomposition method to help agents discover and converge to higher-value niches in complex environments.

Contribution

It proposes a new RL algorithm with a multi-policy architecture and a fitness-sharing inspired learning rule, improving niche exploration and avoidance of local optima.

Findings

01

Agents can escape local optima to find higher-value strategies.

02

The method outperforms baseline deep Q-learning in multi-niche environments.

03

Artificial chemistry platform demonstrates the approach's effectiveness.

Abstract

Many environments contain numerous available niches of variable value, each associated with a different local optimum in the space of behaviors (policy space). In such situations it is often difficult to design a learning process capable of evading distraction by poor local optima long enough to stumble upon the best available niche. In this work we propose a generic reinforcement learning (RL) algorithm that performs better than baseline deep Q-learning algorithms in such environments with multiple variably-valued niches. The algorithm we propose consists of two parts: an agent architecture and a learning rule. The agent architecture contains multiple sub-policies. The learning rule is inspired by fitness sharing in evolutionary computation and applied in reinforcement learning using Value-Decomposition-Networks in a novel manner for a single-agent's internal population. It can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Game Theory and Cooperation · Innovation Diffusion and Forecasting · Evolution and Genetic Dynamics

MethodsQ-Learning