Variable Time Step Reinforcement Learning for Robotic Applications

Dong Wang; Giovanni Beltrame

arXiv:2407.00290·cs.RO·July 2, 2024

Variable Time Step Reinforcement Learning for Robotic Applications

Dong Wang, Giovanni Beltrame

PDF

Open Access 1 Repo

TL;DR

This paper introduces VTS-RL, an adaptive control frequency method for reinforcement learning in robotics, improving efficiency and performance by dynamically adjusting action timing.

Contribution

The paper presents MOSEAC, a novel algorithm for VTS-RL, validated through theoretical analysis and experiments, demonstrating faster convergence and energy savings.

Findings

01

Faster convergence compared to fixed-frequency RL.

02

Reduced energy consumption in robotic tasks.

03

Improved training outcomes with adaptive control frequencies.

Abstract

Traditional reinforcement learning (RL) generates discrete control policies, assigning one action per cycle. These policies are usually implemented as in a fixed-frequency control loop. This rigidity presents challenges as optimal control frequency is task-dependent; suboptimal frequencies increase computational demands and reduce exploration efficiency. Variable Time Step Reinforcement Learning (VTS-RL) addresses these issues with adaptive control frequencies, executing actions only when necessary, thus reducing computational load and extending the action space to include action durations. In this paper we introduce the Multi-Objective Soft Elastic Actor-Critic (MOSEAC) method to perform VTS-RL, validating it through theoretical analysis and experimentation in simulation and on real robots. Results show faster convergence, better training results, and reduced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alpaficia/MOSEAC_Limo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIterative Learning Control Systems