SteadyTray: Learning Object Balancing Tasks in Humanoid Tray Transport via Residual Reinforcement Learning

Anlun Huang; Zhenyu Wu; Soofiyan Atar; Yuheng Zhi; Michael Yip

arXiv:2603.10306·cs.RO·March 12, 2026

SteadyTray: Learning Object Balancing Tasks in Humanoid Tray Transport via Residual Reinforcement Learning

Anlun Huang, Zhenyu Wu, Soofiyan Atar, Yuheng Zhi, Michael Yip

PDF

Open Access

TL;DR

This paper presents SteadyTray, a hierarchical reinforcement learning framework that effectively stabilizes payloads during humanoid tray transport, ensuring robustness and zero-shot sim-to-real transfer in dynamic environments.

Contribution

The paper introduces ReST-RL, a modular hierarchical reinforcement learning architecture that decouples locomotion from payload stabilization for humanoid robots.

Findings

01

Achieved 96.9% success in variable velocity tracking.

02

Attained 74.5% robustness against external force disturbances.

03

Demonstrated reliable zero-shot sim-to-real transfer on hardware.

Abstract

Stabilizing unsecured payloads against the inherent oscillations of dynamic bipedal locomotion remains a critical engineering bottleneck for humanoids in unstructured environments. To solve this, we introduce ReST-RL, a hierarchical reinforcement learning architecture that explicitly decouples locomotion from payload stabilization, evaluated via the SteadyTray benchmark. Rather than relying on monolithic end-to-end learning, our framework integrates a robust base locomotion policy with a dynamic residual module engineered to actively cancel gait-induced perturbations at the end-effector. This architectural separation ensures steady tray transport without degrading the underlying bipedal stability. In simulation, the residual design significantly outperforms end-to-end baselines in gait smoothness and orientation accuracy, achieving a 96.9% success rate in variable velocity tracking and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Robot Manipulation and Learning · Reinforcement Learning in Robotics