Data Augmentation through Expert-guided Symmetry Detection to Improve   Performance in Offline Reinforcement Learning

Giorgio Angelotti; Nicolas Drougard; Caroline P. C. Chanel

arXiv:2112.09943·cs.LG·April 13, 2023

Data Augmentation through Expert-guided Symmetry Detection to Improve Performance in Offline Reinforcement Learning

Giorgio Angelotti, Nicolas Drougard, Caroline P. C. Chanel

PDF

Open Access 1 Repo

TL;DR

This paper introduces an expert-guided symmetry detection method for data augmentation in offline reinforcement learning, improving model accuracy and policy performance in both deterministic and non-deterministic environments.

Contribution

It extends symmetry detection techniques to non-deterministic MDPs and proposes a statistical threshold for categorical environments, enhancing offline RL performance.

Findings

01

Data augmentation reduces distributional shift in learned models

02

Performance improves when applying policies trained on augmented data

03

Method effective in both deterministic and non-deterministic MDPs

Abstract

Offline estimation of the dynamical model of a Markov Decision Process (MDP) is a non-trivial task that greatly depends on the data available in the learning phase. Sometimes the dynamics of the model is invariant with respect to some transformations of the current state and action. Recent works showed that an expert-guided pipeline relying on Density Estimation methods as Deep Neural Network based Normalizing Flows effectively detects this structure in deterministic environments, both categorical and continuous-valued. The acquired knowledge can be exploited to augment the original data set, leading eventually to a reduction in the distributional shift between the true and the learned model. Such data augmentation technique can be exploited as a preliminary process to be executed before adopting an Offline Reinforcement Learning architecture, increasing its performance. In this work we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

giorgioangel/dsym
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Data Stream Mining Techniques · Reinforcement Learning in Robotics

MethodsNetwork On Network · Normalizing Flows