Progression Cognition Reinforcement Learning with Prioritized Experience   for Multi-Vehicle Pursuit

Xinhang Li; Yiying Yang; Zheng Yuan; Zhe Wang; Qinwen Wang; Chen Xu,; Lei Li; Jianhua He; Lin Zhang

arXiv:2306.05016·cs.AI·June 9, 2023·2 cites

Progression Cognition Reinforcement Learning with Prioritized Experience for Multi-Vehicle Pursuit

Xinhang Li, Yiying Yang, Zheng Yuan, Zhe Wang, Qinwen Wang, Chen Xu,, Lei Li, Jianhua He, Lin Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces PEPCRL-MVP, a novel reinforcement learning approach that enhances multi-vehicle pursuit by prioritizing experiences and employing attention-based progression cognition, significantly improving success rates in urban traffic scenarios.

Contribution

It proposes a new MARL method with prioritized experience replay and progression cognition for better collaboration and efficiency in multi-vehicle pursuit tasks.

Findings

01

Achieves 3.95% higher pursuit efficiency than TD3-DMAP.

02

Increases success rate by 34.78% over MADDPG.

03

Demonstrates superior performance in urban multi-intersection environments.

Abstract

Multi-vehicle pursuit (MVP) such as autonomous police vehicles pursuing suspects is important but very challenging due to its mission and safety critical nature. While multi-agent reinforcement learning (MARL) algorithms have been proposed for MVP problem in structured grid-pattern roads, the existing algorithms use randomly training samples in centralized learning, which leads to homogeneous agents showing low collaboration performance. For the more challenging problem of pursuing multiple evading vehicles, these algorithms typically select a fixed target evading vehicle for pursuing vehicles without considering dynamic traffic situation, which significantly reduces pursuing success rate. To address the above problems, this paper proposes a Progression Cognition Reinforcement Learning with Prioritized Experience for MVP (PEPCRL-MVP) in urban multi-intersection dynamic traffic scenes.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bupt-antlab/pepcrl-mvp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning · Vehicular Ad Hoc Networks (VANETs)

MethodsDense Connections · Weight Decay · Batch Normalization · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · MADDPG · Experience Replay