Discovering Behavioral Modes in Deep Reinforcement Learning Policies   Using Trajectory Clustering in Latent Space

Sindre Benjamin Remman; Anastasios M. Lekkas

arXiv:2402.12939·cs.LG·February 21, 2024·1 cites

Discovering Behavioral Modes in Deep Reinforcement Learning Policies Using Trajectory Clustering in Latent Space

Sindre Benjamin Remman, Anastasios M. Lekkas

PDF

Open Access

TL;DR

This paper presents a novel method combining dimensionality reduction and trajectory clustering in latent space to analyze and understand the behavior modes of deep reinforcement learning policies, aiding targeted improvements.

Contribution

It introduces a new approach using PaCMAP and TRACLUS for analyzing DRL policies' latent space, revealing behavior patterns and suboptimal actions.

Findings

01

Identified diverse behavior patterns in DRL policies

02

Enhanced policy performance through targeted analysis

03

Demonstrated effectiveness on Mountain Car task

Abstract

Understanding the behavior of deep reinforcement learning (DRL) agents is crucial for improving their performance and reliability. However, the complexity of their policies often makes them challenging to understand. In this paper, we introduce a new approach for investigating the behavior modes of DRL policies, which involves utilizing dimensionality reduction and trajectory clustering in the latent space of neural networks. Specifically, we use Pairwise Controlled Manifold Approximation Projection (PaCMAP) for dimensionality reduction and TRACLUS for trajectory clustering to analyze the latent space of a DRL policy trained on the Mountain Car control task. Our methodology helps identify diverse behavior patterns and suboptimal choices by the policy, thus allowing for targeted improvements. We demonstrate how our approach, combined with domain knowledge, can enhance a policy's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications