Statistical Context Detection for Deep Lifelong Reinforcement Learning

Jeffery Dick; Saptarshi Nath; Christos Peridis; Eseoghene Benjamin,; Soheil Kolouri; Andrea Soltoggio

arXiv:2405.19047·cs.LG·September 4, 2024

Statistical Context Detection for Deep Lifelong Reinforcement Learning

Jeffery Dick, Saptarshi Nath, Christos Peridis, Eseoghene Benjamin,, Soheil Kolouri, Andrea Soltoggio

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel online deep reinforcement learning method that uses optimal transport metrics for statistical context detection, enabling lifelong learning without prior task labels.

Contribution

It proposes a new approach combining Wasserstein distance and statistical tests for online task detection and policy learning in reinforcement learning.

Findings

01

Effective context detection in lifelong RL without prior labels

02

Comparable or superior performance to existing algorithms on benchmarks

03

Provides explainable and statistically justified detection method

Abstract

Context detection involves labeling segments of an online stream of data as belonging to different tasks. Task labels are used in lifelong learning algorithms to perform consolidation or other procedures that prevent catastrophic forgetting. Inferring task labels from online experiences remains a challenging problem. Most approaches assume finite and low-dimension observation spaces or a preliminary training phase during which task labels are learned. Moreover, changes in the transition or reward functions can be detected only in combination with a policy, and therefore are more difficult to detect than changes in the input distribution. This paper presents an approach to learning both policies and labels in an online deep reinforcement learning setting. The key idea is to use distance metrics, obtained via optimal transport methods, i.e., Wasserstein distance, on suitable latent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jupilogy/swoks
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications