Learning When to Act: Communication-Efficient Reinforcement Learning via Run-Time Assurance

Adam Haroon; Erick J. Rodr\'iguez-Seda; Cody Fleming; Tristan Schuler

arXiv:2605.12561·cs.LG·May 14, 2026

Learning When to Act: Communication-Efficient Reinforcement Learning via Run-Time Assurance

Adam Haroon, Erick J. Rodr\'iguez-Seda, Cody Fleming, Tristan Schuler

PDF

TL;DR

This paper introduces a run-time assurance framework for reinforcement learning that adaptively determines when an agent should act, improving safety and efficiency in control tasks like inverted pendulum and quadrotor stabilization.

Contribution

It proposes a novel approach combining Lyapunov-based safety guarantees with adaptive timing decisions, enabling safer and more communication-efficient RL policies.

Findings

01

Learned policies increase mean inter-sample interval by up to 3.51× over baselines.

02

Fixed LQR controllers are unstable, highlighting the importance of adaptive timing.

03

Lyapunov reward transferability allows environment generalization without retraining.

Abstract

Safe reinforcement learning (RL) typically asks $what$ an agent should do. We ask $when$ it needs to act, and show that a single policy can jointly learn control inputs and communication-efficient timing decisions under a pointwise Lyapunov safety shield. We focus on stabilization around a known equilibrium, where CARE-based LQR backups, Lyapunov certificates, and classical Lyapunov-STC are well defined, enabling clean comparison against analytical baselines. A run-time assurance (RTA) layer overrides the policy via a one-step-ahead Lyapunov prediction and a precomputed LQR backup, providing a strictly stronger guarantee than constrained MDP methods that enforce safety only in expectation. On an inverted pendulum, cart--pole, and planar quadrotor, the learned policy achieves $1.91 \times$ , $1.45 \times$ , and $3.51 \times$ higher mean inter-sample interval (MSI) than a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.