Multi-critic Learning for Whole-body End-effector Twist Tracking

Aravind Elanjimattathil Vijayan; Andrei Cramariuc; Mattia Risiglione; Christian Gehring; Marco Hutter

arXiv:2507.08656·cs.RO·September 1, 2025

Multi-critic Learning for Whole-body End-effector Twist Tracking

Aravind Elanjimattathil Vijayan, Andrei Cramariuc, Mattia Risiglione, Christian Gehring, Marco Hutter

PDF

TL;DR

This paper introduces a multi-critic reinforcement learning framework for simultaneous whole-body locomotion and end-effector control, effectively managing conflicting task goals and enabling smooth trajectory tracking in quadruped robots with arms.

Contribution

It proposes a novel multi-critic actor architecture and twist-based task formulation for velocity-aware, conflict-resolving whole-body control in robotics.

Findings

01

Effective simultaneous locomotion and manipulation demonstrated in simulation.

02

Emergent whole-body behaviors, including base assistance for arm extension.

03

Successful hardware validation on a quadruped robot.

Abstract

Learning whole-body control for locomotion and arm motions in a single policy has challenges, as the two tasks have conflicting goals. For instance, efficient locomotion typically favors a horizontal base orientation, while end-effector tracking may benefit from base tilting to extend reachability. Additionally, current Reinforcement Learning (RL) approaches using a pose-based task specification lack the ability to directly control the end-effector velocity, making smoothly executing trajectories very challenging. To address these limitations, we propose an RL-based framework that allows for dynamic, velocity-aware whole-body end-effector control. Our method introduces a multi-critic actor architecture that decouples the reward signals for locomotion and manipulation, simplifying reward tuning and allowing the policy to resolve task conflicts more effectively. Furthermore, we design a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.