Seq-DeepIPC: Sequential Sensing for End-to-End Control in Legged Robot Navigation
Oskar Natan, Jun Miura

TL;DR
Seq-DeepIPC is a novel end-to-end perception-to-control model for legged robot navigation that integrates multi-modal sensing, temporal fusion, and efficient processing, enabling robust navigation in diverse real-world environments.
Contribution
The paper introduces Seq-DeepIPC, a new sequential perception-to-control model that combines multi-modal perception with temporal fusion for improved legged robot navigation.
Findings
Sequential inputs enhance perception and control accuracy.
EfficientNet-B0 enables real-time deployment on edge devices.
GNSS-based heading estimation is effective in open areas.
Abstract
We present Seq-DeepIPC, a sequential end-to-end perception-to-control model for legged robot navigation in realworld environments. Seq-DeepIPC advances intelligent sensing for autonomous legged navigation by tightly integrating multi-modal perception (RGB-D + GNSS) with temporal fusion and control. The model jointly predicts semantic segmentation and depth estimation, giving richer spatial features for planning and control. For efficient deployment on edge devices, we use EfficientNet-B0 as the encoder, reducing computation while maintaining accuracy. Heading estimation is simplified by removing the noisy IMU and instead computing the bearing angle directly from consecutive GNSS positions. We collected a larger and more diverse dataset that includes both road and grass terrains, and validated Seq-DeepIPC on a robot dog. Comparative and ablation studies show that sequential inputs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
