Statistical Context Detection for Deep Lifelong Reinforcement Learning
Jeffery Dick, Saptarshi Nath, Christos Peridis, Eseoghene Benjamin,, Soheil Kolouri, Andrea Soltoggio

TL;DR
This paper introduces a novel online deep reinforcement learning method that uses optimal transport metrics for statistical context detection, enabling lifelong learning without prior task labels.
Contribution
It proposes a new approach combining Wasserstein distance and statistical tests for online task detection and policy learning in reinforcement learning.
Findings
Effective context detection in lifelong RL without prior labels
Comparable or superior performance to existing algorithms on benchmarks
Provides explainable and statistically justified detection method
Abstract
Context detection involves labeling segments of an online stream of data as belonging to different tasks. Task labels are used in lifelong learning algorithms to perform consolidation or other procedures that prevent catastrophic forgetting. Inferring task labels from online experiences remains a challenging problem. Most approaches assume finite and low-dimension observation spaces or a preliminary training phase during which task labels are learned. Moreover, changes in the transition or reward functions can be detected only in combination with a policy, and therefore are more difficult to detect than changes in the input distribution. This paper presents an approach to learning both policies and labels in an online deep reinforcement learning setting. The key idea is to use distance metrics, obtained via optimal transport methods, i.e., Wasserstein distance, on suitable latent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications
