Adaptive Resource Management in Cognitive Radar via Deep Deterministic Policy Gradient
Ziyang Lu, M. Cenk Gursoy, Chilukuri K. Mohan, Pramod K. Varshney

TL;DR
This paper introduces a deep reinforcement learning approach to optimize resource allocation in cognitive radar systems, enabling autonomous and efficient multi-target tracking under time constraints.
Contribution
It develops a constrained deep reinforcement learning algorithm using DDPG for adaptive resource management in cognitive radar, addressing joint optimization of scanning and tracking.
Findings
Radar autonomously allocates time to maximize tracking performance.
The proposed method respects time budget constraints.
Numerical results demonstrate improved resource management efficiency.
Abstract
In this paper, scanning for target detection, and multi-target tracking in a cognitive radar system are considered, and adaptive radar resource management is investigated. In particular, time management for radar scanning and tracking of multiple maneuvering targets subject to budget constraints is studied with the goal to jointly maximize the tracking and scanning performances of a cognitive radar. We tackle the constrained optimization problem of allocating the dwell time to track individual targets by employing a deep deterministic policy gradient (DDPG) based reinforcement learning approach. We propose a constrained deep reinforcement learning (CDRL) algorithm that updates the DDPG neural networks and dual variables simultaneously. Numerical results show that the radar can autonomously allocate time appropriately so as to maximize the reward function without exceeding the time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
