Assistax: A Hardware-Accelerated Reinforcement Learning Benchmark for Assistive Robotics

Leonard Hinckeldey; Elliot Fosong; Elle Miller; Rimvydas Rubavicius; Trevor McInroe; Patricia Wollstadt; Christiane B. Wiebel-Herboth; Subramanian Ramamoorthy; Stefano V. Albrecht

arXiv:2507.21638·cs.AI·July 30, 2025

Assistax: A Hardware-Accelerated Reinforcement Learning Benchmark for Assistive Robotics

Leonard Hinckeldey, Elliot Fosong, Elle Miller, Rimvydas Rubavicius, Trevor McInroe, Patricia Wollstadt, Christiane B. Wiebel-Herboth, Subramanian Ramamoorthy, Stefano V. Albrecht

PDF

TL;DR

Assistax introduces a fast, open-source reinforcement learning benchmark tailored for assistive robotics, utilizing hardware acceleration and multi-agent RL to evaluate robot-human interaction capabilities.

Contribution

It presents a novel, hardware-accelerated benchmark specifically designed for assistive robotics, addressing real-world embodied interaction challenges.

Findings

01

Assistax achieves up to 370x faster training times compared to CPU-based methods.

02

Provides reliable baselines for RL and MARL algorithms in assistive robotics.

03

Establishes a practical benchmark for advancing RL in real-world assistive scenarios.

Abstract

The development of reinforcement learning (RL) algorithms has been largely driven by ambitious challenge tasks and benchmarks. Games have dominated RL benchmarks because they present relevant challenges, are inexpensive to run and easy to understand. While games such as Go and Atari have led to many breakthroughs, they often do not directly translate to real-world embodied applications. In recognising the need to diversify RL benchmarks and addressing complexities that arise in embodied interaction scenarios, we introduce Assistax: an open-source benchmark designed to address challenges arising in assistive robotics tasks. Assistax uses JAX's hardware acceleration for significant speed-ups for learning in physics-based simulations. In terms of open-loop wall-clock time, Assistax runs up to $370 \times$ faster when vectorising training runs compared to CPU-based alternatives. Assistax…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.