Data Augmentation through Expert-guided Symmetry Detection to Improve Performance in Offline Reinforcement Learning
Giorgio Angelotti, Nicolas Drougard, Caroline P. C. Chanel

TL;DR
This paper introduces an expert-guided symmetry detection method for data augmentation in offline reinforcement learning, improving model accuracy and policy performance in both deterministic and non-deterministic environments.
Contribution
It extends symmetry detection techniques to non-deterministic MDPs and proposes a statistical threshold for categorical environments, enhancing offline RL performance.
Findings
Data augmentation reduces distributional shift in learned models
Performance improves when applying policies trained on augmented data
Method effective in both deterministic and non-deterministic MDPs
Abstract
Offline estimation of the dynamical model of a Markov Decision Process (MDP) is a non-trivial task that greatly depends on the data available in the learning phase. Sometimes the dynamics of the model is invariant with respect to some transformations of the current state and action. Recent works showed that an expert-guided pipeline relying on Density Estimation methods as Deep Neural Network based Normalizing Flows effectively detects this structure in deterministic environments, both categorical and continuous-valued. The acquired knowledge can be exploited to augment the original data set, leading eventually to a reduction in the distributional shift between the true and the learned model. Such data augmentation technique can be exploited as a preliminary process to be executed before adopting an Offline Reinforcement Learning architecture, increasing its performance. In this work we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Data Stream Mining Techniques · Reinforcement Learning in Robotics
MethodsNetwork On Network · Normalizing Flows
