DOA: A Degeneracy Optimization Agent with Adaptive Pose Compensation Capability based on Deep Reinforcement Learning

Yanbin Li; Canran Xiao; Hongyang He; Shenghai Yuan; Zong Ke; Jiajie Yu; Zixiong Qin; Zhiguo Zhang; Wenzheng Chi; Wei Zhang

arXiv:2507.19742·cs.RO·July 29, 2025

DOA: A Degeneracy Optimization Agent with Adaptive Pose Compensation Capability based on Deep Reinforcement Learning

Yanbin Li, Canran Xiao, Hongyang He, Shenghai Yuan, Zong Ke, Jiajie Yu, Zixiong Qin, Zhiguo Zhang, Wenzheng Chi, Wei Zhang

PDF

TL;DR

This paper introduces DOA, an adaptive reinforcement learning agent designed to improve 2D-SLAM in degenerate indoor environments by dynamically optimizing sensor contributions and enhancing generalization across different settings.

Contribution

The paper presents a novel PPO-based degeneracy optimization agent with a specialized reward function and transfer learning, addressing data, quality, and annotation challenges in SLAM.

Findings

01

DOA outperforms SOTA methods in degeneracy detection and optimization.

02

The agent effectively adapts sensor contributions based on degeneracy factors.

03

Transfer learning enhances the generalization of the agent across environments.

Abstract

Particle filter-based 2D-SLAM is widely used in indoor localization tasks due to its efficiency. However, indoor environments such as long straight corridors can cause severe degeneracy problems in SLAM. In this paper, we use Proximal Policy Optimization (PPO) to train an adaptive degeneracy optimization agent (DOA) to address degeneracy problem. We propose a systematic methodology to address three critical challenges in traditional supervised learning frameworks: (1) data acquisition bottlenecks in degenerate dataset, (2) inherent quality deterioration of training samples, and (3) ambiguity in annotation protocol design. We design a specialized reward function to guide the agent in developing perception capabilities for degenerate environments. Using the output degeneracy factor as a reference weight, the agent can dynamically adjust the contribution of different sensors to pose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.