SteadyTray: Learning Object Balancing Tasks in Humanoid Tray Transport via Residual Reinforcement Learning
Anlun Huang, Zhenyu Wu, Soofiyan Atar, Yuheng Zhi, Michael Yip

TL;DR
This paper presents SteadyTray, a hierarchical reinforcement learning framework that effectively stabilizes payloads during humanoid tray transport, ensuring robustness and zero-shot sim-to-real transfer in dynamic environments.
Contribution
The paper introduces ReST-RL, a modular hierarchical reinforcement learning architecture that decouples locomotion from payload stabilization for humanoid robots.
Findings
Achieved 96.9% success in variable velocity tracking.
Attained 74.5% robustness against external force disturbances.
Demonstrated reliable zero-shot sim-to-real transfer on hardware.
Abstract
Stabilizing unsecured payloads against the inherent oscillations of dynamic bipedal locomotion remains a critical engineering bottleneck for humanoids in unstructured environments. To solve this, we introduce ReST-RL, a hierarchical reinforcement learning architecture that explicitly decouples locomotion from payload stabilization, evaluated via the SteadyTray benchmark. Rather than relying on monolithic end-to-end learning, our framework integrates a robust base locomotion policy with a dynamic residual module engineered to actively cancel gait-induced perturbations at the end-effector. This architectural separation ensures steady tray transport without degrading the underlying bipedal stability. In simulation, the residual design significantly outperforms end-to-end baselines in gait smoothness and orientation accuracy, achieving a 96.9% success rate in variable velocity tracking and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Robot Manipulation and Learning · Reinforcement Learning in Robotics
