When to Sense and Control? A Time-adaptive Approach for Continuous-Time   RL

Lenart Treven; Bhavya Sukhija; Yarden As; Florian D\"orfler; Andreas; Krause

arXiv:2406.01163·cs.LG·November 1, 2024

When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL

Lenart Treven, Bhavya Sukhija, Yarden As, Florian D\"orfler, Andreas, Krause

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a time-adaptive reinforcement learning framework, TaCoS, that optimizes both control actions and their durations, reducing interactions and improving efficiency in continuous-time systems.

Contribution

The paper formalizes the TaCoS framework for adaptive control and sensing, extending MDPs to optimize action durations, and proposes OTaCoS, a model-based algorithm with sublinear regret.

Findings

01

TaCoS reduces system interactions significantly compared to discrete-time RL.

02

State-of-the-art RL algorithms perform well within the TaCoS framework.

03

OTaCoS achieves sample-efficiency gains and sublinear regret in smooth systems.

Abstract

Reinforcement learning (RL) excels in optimizing policies for discrete-time Markov decision processes (MDP). However, various systems are inherently continuous in time, making discrete-time MDPs an inexact modeling choice. In many applications, such as greenhouse control or medical treatments, each interaction (measurement or switching of action) involves manual intervention and thus is inherently costly. Therefore, we generally prefer a time-adaptive approach with fewer interactions with the system. In this work, we formalize an RL framework, Time-adaptive Control & Sensing (TaCoS), that tackles this challenge by optimizing over policies that besides control predict the duration of its application. Our formulation results in an extended MDP that any standard RL algorithm can solve. We demonstrate that state-of-the-art RL algorithms trained on TaCoS drastically reduce the interaction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lasgroup/model-based-rl
jaxOfficial

Videos

When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL· slideslive

Taxonomy

TopicsAdvanced Adaptive Filtering Techniques · Control Systems and Identification · Advanced Control Systems Optimization