Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments

Xiaoyi He; Danggui Chen; Zhenshuo Zhang; Zimeng Bai

arXiv:2510.26646·cs.RO·October 31, 2025

Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments

Xiaoyi He, Danggui Chen, Zhenshuo Zhang, Zimeng Bai

PDF

TL;DR

This paper introduces a hierarchical reinforcement learning framework combining DQN and TD3 for autonomous navigation, demonstrating improved success, safety, and efficiency in dynamic environments.

Contribution

It presents a novel hybrid DQN-TD3 architecture with a safety gate and reward shaping for robust navigation in dynamic, partially observable settings.

Findings

01

Higher success rate compared to baseline algorithms

02

Enhanced generalization to unseen obstacle scenarios

03

Reduced abrupt control changes and improved safety

Abstract

This paper presents a hierarchical path-planning and control framework that combines a high-level Deep Q-Network (DQN) for discrete sub-goal selection with a low-level Twin Delayed Deep Deterministic Policy Gradient (TD3) controller for continuous actuation. The high-level module selects behaviors and sub-goals; the low-level module executes smooth velocity commands. We design a practical reward shaping scheme (direction, distance, obstacle avoidance, action smoothness, collision penalty, time penalty, and progress), together with a LiDAR-based safety gate that prevents unsafe motions. The system is implemented in ROS + Gazebo (TurtleBot3) and evaluated with PathBench metrics, including success rate, collision rate, path efficiency, and re-planning efficiency, in dynamic and partially observable environments. Experiments show improved success rate and sample efficiency over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.