# A Federated Hierarchical DQN-Based Distributed Intelligent Anti-Jamming Method for UAVs

**Authors:** Dadong Ni, Shuo Ma, Junyi Du, Yuansheng Wu, Chengxu Zhou, Haitao Xiao

PMC · DOI: 10.3390/s26010181 · Sensors (Basel, Switzerland) · 2025-12-26

## TL;DR

This paper introduces a new method for UAVs to work together and avoid communication jamming using advanced AI techniques.

## Contribution

The paper proposes FL-HDQN, a novel federated learning-based hierarchical DQN method for multi-UAV anti-jamming.

## Key findings

- FL-HDQN ensures decision consistency and reduces communication costs by sharing model parameters, not raw data.
- The hierarchical model improves real-time performance and decision effectiveness in multi-domain interference scenarios.
- Simulations show FL-HDQN achieves 1% higher decision accuracy than existing anti-jamming models.

## Abstract

In recent years, with the rapid development of intelligent communication technologies, anti-jamming techniques based on deep learning have been widely adopted in unmanned aerial vehicle (UAV) systems, yielding significant improvements. Most existing studies primarily focus on intelligent anti-jamming decision-making for single UAVs. However, in UAV swarm systems, single-agent decision models often suffer from data isolation and inconsistent frequency usage decisions among nodes within the same task subnet, caused by asynchronous model updates. Although data sharing among UAVs can partially alleviate model update issues, it introduces significant communication overhead and data security challenges. To address these problems, this paper proposes a novel multi-UAV cooperative intelligent anti-jamming decision-making method, termed Federated Learning-Hierarchical Deep Q-Network (FL-HDQN). First, an adaptive model synchronization mechanism is integrated into the federated learning framework. By sharing only local model parameters instead of raw data, UAVs collaboratively train a global model for each task subnet. This approach ensures decision consistency while preserving data privacy and reducing communication costs. Second, to overcome the curse of dimensionality caused by multi-domain interference parameters, a hierarchical deep reinforcement learning model is designed. The model decouples multi-domain optimization into two levels: the first layer performs time–frequency domain decisions, and the second layer conducts power and modulation-coding domain decisions, ensuring both real-time performance and decision effectiveness. Finally, simulation results demonstrate that, compared with state-of-the-art intelligent anti-jamming models, the proposed method achieves 1% higher decision accuracy, validating its superiority and effectiveness.

## Full-text entities

- **Diseases:** injury to (MESH:D014947)
- **Chemicals:** MDJ (MESH:C501952), FL (MESH:D005459), HDQN (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12788220/full.md

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12788220/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/PMC12788220/full.md

---
Source: https://tomesphere.com/paper/PMC12788220