Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
Peter Sunehag, Alexander Sasha Vezhnevets, Edgar Du\'e\~nez-Guzm\'an,, Igor Mordach, Joel Z. Leibo

TL;DR
This paper introduces a reinforcement learning algorithm that uses multiple sub-policies and a novel value-decomposition method to help agents discover and converge to higher-value niches in complex environments.
Contribution
It proposes a new RL algorithm with a multi-policy architecture and a fitness-sharing inspired learning rule, improving niche exploration and avoidance of local optima.
Findings
Agents can escape local optima to find higher-value strategies.
The method outperforms baseline deep Q-learning in multi-niche environments.
Artificial chemistry platform demonstrates the approach's effectiveness.
Abstract
Many environments contain numerous available niches of variable value, each associated with a different local optimum in the space of behaviors (policy space). In such situations it is often difficult to design a learning process capable of evading distraction by poor local optima long enough to stumble upon the best available niche. In this work we propose a generic reinforcement learning (RL) algorithm that performs better than baseline deep Q-learning algorithms in such environments with multiple variably-valued niches. The algorithm we propose consists of two parts: an agent architecture and a learning rule. The agent architecture contains multiple sub-policies. The learning rule is inspired by fitness sharing in evolutionary computation and applied in reinforcement learning using Value-Decomposition-Networks in a novel manner for a single-agent's internal population. It can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Game Theory and Cooperation · Innovation Diffusion and Forecasting · Evolution and Genetic Dynamics
MethodsQ-Learning
