Learning to Evolve: Multi-modal Interactive Fields for Robust Humanoid Navigation in Dynamic Environments

Peifeng Jiang; Hong Liu; Jin Jin; Wenshuai Wang; Xia Li

arXiv:2605.21935·cs.RO·May 22, 2026

Learning to Evolve: Multi-modal Interactive Fields for Robust Humanoid Navigation in Dynamic Environments

Peifeng Jiang, Hong Liu, Jin Jin, Wenshuai Wang, Xia Li

PDF

1 Repo

TL;DR

This paper presents MIF, a multi-modal system for humanoid robot navigation that maintains reliable scene memory under dynamic conditions, improving safety and robustness in real-world environments.

Contribution

The introduction of Multi-modal Interactive Field (MIF), integrating confidence-aware semantic 3D Gaussian Splatting, discrepancy-triggered memory updates, and task-driven geometric reconstruction.

Findings

01

Relocation success in dynamic environments increased from 12% to 94%.

02

Semantic memory footprint reduced by 91.4%.

03

Effective separation of false positives from persistent changes.

Abstract

Safe manipulation-oriented navigation for humanoid robots requires scene memory that remains reliable under locomotion-induced perceptual distortion, environmental changes, and interaction-level geometric safety constraints. Existing semantic mapping and scene-graph systems are difficult to deploy directly in this setting because they often assume stable camera trajectories, static environments, or coarse object geometry. We introduce the Multi-modal Interactive Field (MIF), a humanoid-oriented system that integrates confidence-aware semantic 3D Gaussian Splatting, discrepancy-triggered spatial memory updates, and task-driven geometric reconstruction within a closed-loop perception-adaptation pipeline. MIF couples three fields: an uncertainty-aware 3DGS Appearance Field that suppresses gait-induced blur, a Spatial Field that maintains topological memory, and a Geometry Field that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://ziya-jiang.github.io/MIF-homepage
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.