Training and Simulation of Quadrupedal Robot in Adaptive Stair Climbing for Indoor Firefighting: An End-to-End Reinforcement Learning Approach
Baixiao Huang, Baiyu Huang, Yu Hou

TL;DR
This paper presents a two-stage end-to-end reinforcement learning framework enabling quadrupedal robots to adaptively climb various indoor stairs, improving navigation and locomotion in complex fire rescue scenarios.
Contribution
The study introduces a novel transfer learning approach from abstract to realistic stair environments and a unified navigation-locomotion learning method without hierarchical planning.
Findings
Successful transfer of stair-climbing skills across environments
Robust policy generalization to different staircase shapes
Insights into success and failure modes with increasing stair difficulty
Abstract
Quadruped robots are used for primary searches during the early stages of indoor fires. A typical primary search involves quickly and thoroughly looking for victims under hazardous conditions and monitoring flammable materials. However, situational awareness in complex indoor environments and rapid stair climbing across different staircases remain the main challenges for robot-assisted primary searches. In this project, we designed a two-stage end-to-end deep reinforcement learning (RL) approach to optimize both navigation and locomotion. In the first stage, the quadrupeds, Unitree Go2, were trained to climb stairs in Isaac Lab's pyramid-stair terrain. In the second stage, the quadrupeds were trained to climb various realistic indoor staircases in the Isaac Lab engine, with the learned policy transferred from the previous stage. These indoor staircases are straight, L-shaped, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvacuation and Crowd Dynamics · Robotic Locomotion and Control · Social Robot Interaction and HRI
