Robust Policy Learning for Multi-UAV Collision Avoidance with Causal   Feature Selection

Jiafan Zhuang; Gaofei Han; Zihao Xia; Boxi Wang; Wenji Li; and Dongliang Wang; Zhifeng Hao; Ruichu Cai; Zhun Fan

arXiv:2407.04056·cs.RO·July 16, 2024

Robust Policy Learning for Multi-UAV Collision Avoidance with Causal Feature Selection

Jiafan Zhuang, Gaofei Han, Zihao Xia, Boxi Wang, Wenji Li, and Dongliang Wang, Zhifeng Hao, Ruichu Cai, Zhun Fan

PDF

Open Access

TL;DR

This paper introduces a causal feature selection module for deep reinforcement learning-based UAV collision avoidance, significantly improving generalization and robustness in unseen environments.

Contribution

It proposes a novel causal feature selection method that enhances policy robustness by filtering out non-causal factors in UAV navigation.

Findings

01

Outperforms existing algorithms in unseen environments

02

Improves collision avoidance robustness

03

Reduces influence of spurious correlations

Abstract

In unseen and complex outdoor environments, collision avoidance navigation for unmanned aerial vehicle (UAV) swarms presents a challenging problem. It requires UAVs to navigate through various obstacles and complex backgrounds. Existing collision avoidance navigation methods based on deep reinforcement learning show promising performance but suffer from poor generalization abilities, resulting in performance degradation in unseen environments. To address this issue, we investigate the cause of weak generalization ability in DRL and propose a novel causal feature selection module. This module can be integrated into the policy network and effectively filters out non-causal factors in representations, thereby reducing the influence of spurious correlations between non-causal factors and action predictions. Experimental results demonstrate that our proposed method can achieve robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics