Velocity-History-Based Soft Actor-Critic Tackling IROS'24 Competition   "AI Olympics with RealAIGym"

Tim Lukas Faust; Habib Maraqten; Erfan Aghadavoodi; Boris Belousov and; Jan Peters

arXiv:2410.20096·cs.RO·October 29, 2024

Velocity-History-Based Soft Actor-Critic Tackling IROS'24 Competition "AI Olympics with RealAIGym"

Tim Lukas Faust, Habib Maraqten, Erfan Aghadavoodi, Boris Belousov and, Jan Peters

PDF

Open Access

TL;DR

This paper introduces a novel reinforcement learning approach based on Soft Actor-Critic with a history-encoding context vector, successfully applied to control chaotic underactuated systems in the IROS'24 competition.

Contribution

It extends SAC with a CNN-based context vector to incorporate history, improving control of chaotic systems in real-world competitions.

Findings

01

Achieved high performance scores in IROS'24 competition

02

Demonstrated robustness on Pendubot and Acrobot tasks

03

Outperformed baseline control algorithms

Abstract

The ``AI Olympics with RealAIGym'' competition challenges participants to stabilize chaotic underactuated dynamical systems with advanced control algorithms. In this paper, we present a novel solution submitted to IROS'24 competition, which builds upon Soft Actor-Critic (SAC), a popular model-free entropy-regularized Reinforcement Learning (RL) algorithm. We add a `context' vector to the state, which encodes the immediate history via a Convolutional Neural Network (CNN) to counteract the unmodeled effects on the real system. Our method achieves high performance scores and competitive robustness scores on both tracks of the competition: Pendubot and Acrobot.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Human Motion and Animation · Video Analysis and Summarization