# Robust Learning-Based Detection with Cost Control and Byzantine Mitigation

**Authors:** Chen Zhong, M. Cenk Gursoy, Senem Velipasalar

PMC · DOI: 10.3390/s26010005 · Sensors (Basel, Switzerland) · 2025-12-19

## TL;DR

This paper introduces a deep reinforcement learning framework for reliable system state detection while managing costs and mitigating Byzantine attacks.

## Contribution

A novel SAC-DRL framework with a GAN-based Byzantine detector for robust state estimation under noisy and adversarial conditions.

## Key findings

- Soft actor-critic algorithms achieve stable performance in imperfect environments with jamming and high sensing costs.
- The GAN-based framework effectively identifies Byzantine sensors, improving system reliability and detection accuracy.
- SAC-DRL outperforms conventional actor-critic and fixed scheduling methods in detection accuracy and cost efficiency.

## Abstract

To address the state estimation and detection problem in the presence of noisy sensor observations, probing costs, and communication noise, we in this paper propose a soft actor-critic (SAC) deep reinforcement learning (DRL) framework for dynamically scheduling sensors and sequentially probing the state of a stochastic system. Moreover, considering Byzantine attacks, we design a generative adversarial network (GAN)-based framework to identify the Byzantine sensors. The GAN-based Byzantine detector and SAC-DRL-based agent are developed to operate in coordination to detect the state of the system reliably and fast while incurring small sensing cost. To evaluate the proposed framework, we measure the performance in terms of detection accuracy, stopping time, and the total probing cost needed for detection. Via simulation results, we analyze the performances and demonstrate that soft actor–critic algorithms are flexible and effective in action selection in imperfectly known environments due to the maximum entropy strategy and they can achieve stable performance levels in challenging test cases (e.g., involving jamming attacks, imperfectly known noise power levels, and high sensing cost scenarios). We also provide comparisons between the performances of the proposed soft actor–critic and conventional actor–critic algorithms as well as fixed scheduling strategies. Finally, we analyze the impact of Byzantine attacks and identify the reliability and accuracy improvements achieved by the GAN-based approach when combined with the SAC-DRL-based decision-making agent.

## Full-text entities

- **Genes:** GAN (gigaxonin) [NCBI Gene 8139] {aka GAN1, GIG, KLHL16}
- **Diseases:** injury to (MESH:D014947)
- **Chemicals:** Byzantine (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12787988/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12787988/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/PMC12787988/full.md

---
Source: https://tomesphere.com/paper/PMC12787988