Progression Cognition Reinforcement Learning with Prioritized Experience for Multi-Vehicle Pursuit
Xinhang Li, Yiying Yang, Zheng Yuan, Zhe Wang, Qinwen Wang, Chen Xu,, Lei Li, Jianhua He, Lin Zhang

TL;DR
This paper introduces PEPCRL-MVP, a novel reinforcement learning approach that enhances multi-vehicle pursuit by prioritizing experiences and employing attention-based progression cognition, significantly improving success rates in urban traffic scenarios.
Contribution
It proposes a new MARL method with prioritized experience replay and progression cognition for better collaboration and efficiency in multi-vehicle pursuit tasks.
Findings
Achieves 3.95% higher pursuit efficiency than TD3-DMAP.
Increases success rate by 34.78% over MADDPG.
Demonstrates superior performance in urban multi-intersection environments.
Abstract
Multi-vehicle pursuit (MVP) such as autonomous police vehicles pursuing suspects is important but very challenging due to its mission and safety critical nature. While multi-agent reinforcement learning (MARL) algorithms have been proposed for MVP problem in structured grid-pattern roads, the existing algorithms use randomly training samples in centralized learning, which leads to homogeneous agents showing low collaboration performance. For the more challenging problem of pursuing multiple evading vehicles, these algorithms typically select a fixed target evading vehicle for pursuing vehicles without considering dynamic traffic situation, which significantly reduces pursuing success rate. To address the above problems, this paper proposes a Progression Cognition Reinforcement Learning with Prioritized Experience for MVP (PEPCRL-MVP) in urban multi-intersection dynamic traffic scenes.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning · Vehicular Ad Hoc Networks (VANETs)
MethodsDense Connections · Weight Decay · Batch Normalization · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · MADDPG · Experience Replay
