Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management

Ziyang Lu; Subodh Kalia; M. Cenk Gursoy; Chilukuri K. Mohan; Pramod K. Varshney

arXiv:2506.20853·cs.LG·June 27, 2025

Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management

Ziyang Lu, Subodh Kalia, M. Cenk Gursoy, Chilukuri K. Mohan, Pramod K. Varshney

PDF

Open Access

TL;DR

This paper applies deep reinforcement learning to optimize resource management in cognitive radar systems, balancing target detection and tracking through Pareto-efficient solutions.

Contribution

It introduces a multi-objective optimization framework using DDPG and SAC algorithms for cognitive radar resource management, comparing their performance.

Findings

01

SAC outperforms DDPG in stability and sample efficiency

02

Both algorithms effectively adapt to different scenarios

03

NSGA-II provides an upper bound on the Pareto front

Abstract

The time allocation problem in multi-function cognitive radar systems focuses on the trade-off between scanning for newly emerging targets and tracking the previously detected targets. We formulate this as a multi-objective optimization problem and employ deep reinforcement learning to find Pareto-optimal solutions and compare deep deterministic policy gradient (DDPG) and soft actor-critic (SAC) algorithms. Our results demonstrate the effectiveness of both algorithms in adapting to various scenarios, with SAC showing improved stability and sample efficiency compared to DDPG. We further employ the NSGA-II algorithm to estimate an upper bound on the Pareto front of the considered problem. This work contributes to the development of more efficient and adaptive cognitive radar systems capable of balancing multiple competing objectives in dynamic environments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAir Traffic Management and Optimization · Military Defense Systems Analysis · Radar Systems and Signal Processing

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Experience Replay · Dense Connections · Deep Deterministic Policy Gradient