Geometric Active Exploration in Markov Decision Processes: the Benefit   of Abstraction

Riccardo De Santi; Federico Arangath Joseph; Noah Liniger; Mirco; Mutti; Andreas Krause

arXiv:2407.13364·cs.LG·July 19, 2024

Geometric Active Exploration in Markov Decision Processes: the Benefit of Abstraction

Riccardo De Santi, Federico Arangath Joseph, Noah Liniger, Mirco, Mutti, Andreas Krause

PDF

Open Access

TL;DR

This paper introduces Geometric Active Exploration (GAE), a novel algorithm that leverages geometric abstractions in Markov Decision Processes to improve the efficiency of experimental design in scientific discovery.

Contribution

It extends MDP homomorphisms to Convex RL and provides the first formal analysis of how abstraction benefits sample efficiency in active exploration.

Findings

01

GAE outperforms baseline methods in efficiency.

02

Abstraction via homomorphisms improves sample complexity.

03

Theoretical analysis confirms the benefits of geometric abstraction.

Abstract

How can a scientist use a Reinforcement Learning (RL) algorithm to design experiments over a dynamical system's state space? In the case of finite and Markovian systems, an area called Active Exploration (AE) relaxes the optimization problem of experiments design into Convex RL, a generalization of RL admitting a wider notion of reward. Unfortunately, this framework is currently not scalable and the potential of AE is hindered by the vastness of experiment spaces typical of scientific discovery applications. However, these spaces are often endowed with natural geometries, e.g., permutation invariance in molecular design, that an agent could leverage to improve the statistical and computational efficiency of AE. To achieve this, we bridge AE and MDP homomorphisms, which offer a way to exploit known geometric structures via abstraction. Towards this goal, we make two fundamental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Robotic Path Planning Algorithms · Data Management and Algorithms

MethodsAutoencoders