Meta-Cognition. An Inverse-Inverse Reinforcement Learning Approach for Cognitive Radars
Kunal Pattanayak, Vikram Krishnamurthy, Christopher Berry

TL;DR
This paper introduces a novel inverse-inverse reinforcement learning framework for meta-cognitive radars that adapt their responses to confuse adversarial targets attempting to estimate their utility functions.
Contribution
It formulates the meta-cognition problem using spectral noise covariance analysis and embeds Riccati equations into an economic utility maximization model, pioneering the I-IRL approach.
Findings
Meta-cognitive radar can effectively increase the adversary's detection error.
The proposed approach demonstrates how to deliberately choose sub-optimal responses.
Numerical examples validate the effectiveness of the I-IRL method.
Abstract
This paper considers meta-cognitive radars in an adversarial setting. A cognitive radar optimally adapts its waveform (response) in response to maneuvers (probes) of a possibly adversarial moving target. A meta-cognitive radar is aware of the adversarial nature of the target and seeks to mitigate the adversarial target. How should the meta-cognitive radar choose its responses to sufficiently confuse the adversary trying to estimate the radar's utility function? This paper abstracts the radar's meta-cognition problem in terms of the spectra (eigenvalues) of the state and observation noise covariance matrices, and embeds the algebraic Riccati equation into an economics-based utility maximization setup. This adversarial target is an inverse reinforcement learner. By observing a noisy sequence of radar's responses (waveforms), the adversarial target uses a statistical hypothesis test to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy
MethodsAttentive Walk-Aggregating Graph Neural Network
