Training and Simulation of Quadrupedal Robot in Adaptive Stair Climbing for Indoor Firefighting: An End-to-End Reinforcement Learning Approach

Baixiao Huang; Baiyu Huang; Yu Hou

arXiv:2602.03087·cs.RO·February 4, 2026

Training and Simulation of Quadrupedal Robot in Adaptive Stair Climbing for Indoor Firefighting: An End-to-End Reinforcement Learning Approach

Baixiao Huang, Baiyu Huang, Yu Hou

PDF

Open Access

TL;DR

This paper presents a two-stage end-to-end reinforcement learning framework enabling quadrupedal robots to adaptively climb various indoor stairs, improving navigation and locomotion in complex fire rescue scenarios.

Contribution

The study introduces a novel transfer learning approach from abstract to realistic stair environments and a unified navigation-locomotion learning method without hierarchical planning.

Findings

01

Successful transfer of stair-climbing skills across environments

02

Robust policy generalization to different staircase shapes

03

Insights into success and failure modes with increasing stair difficulty

Abstract

Quadruped robots are used for primary searches during the early stages of indoor fires. A typical primary search involves quickly and thoroughly looking for victims under hazardous conditions and monitoring flammable materials. However, situational awareness in complex indoor environments and rapid stair climbing across different staircases remain the main challenges for robot-assisted primary searches. In this project, we designed a two-stage end-to-end deep reinforcement learning (RL) approach to optimize both navigation and locomotion. In the first stage, the quadrupeds, Unitree Go2, were trained to climb stairs in Isaac Lab's pyramid-stair terrain. In the second stage, the quadrupeds were trained to climb various realistic indoor staircases in the Isaac Lab engine, with the learned policy transferred from the previous stage. These indoor staircases are straight, L-shaped, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvacuation and Crowd Dynamics · Robotic Locomotion and Control · Social Robot Interaction and HRI