# ATDMNet: Multi-Head Agent Attention and Top-k Dynamic Mask for Camouflaged Object Detection

**Authors:** Rui Fu, Yuehui Li, Chih-Cheng Chen, Yile Duan, Pengjian Yao, Kaixin Zhou

PMC · DOI: 10.3390/s25103001 · Sensors (Basel, Switzerland) · 2025-05-09

## TL;DR

This paper introduces ATDMNet, a new method for detecting camouflaged objects using a combination of CNNs and transformers with attention mechanisms.

## Contribution

The novel ATDMNet architecture integrates multi-head agent attention and top-k dynamic mask for improved camouflaged object detection.

## Key findings

- ATDMNet outperforms existing methods in COD datasets like NC4K and COD10K.
- The use of MHA and TDM improves feature sensitivity and segmentation accuracy.

## Abstract

Camouflaged object detection (COD) encounters substantial difficulties owing to the visual resemblance between targets and their environments, together with discrepancies in multiscale representation of features. Current methodologies confront obstacles with feature distraction, modeling far-reaching dependencies, fusing multiple-scale details, and extracting boundary specifics. Consequently, we propose ATDMNet, an amalgamated architecture combining CNN and transformer within a numerous phases feature extraction framework. ATDMNet employs Res2Net as the foundational encoder and incorporates two essential components: multi-head agent attention (MHA) and top-k dynamic mask (TDM). MHA improves local feature sensitivity and long-range dependency modeling by incorporating agent nodes and positional biases, whereas TDM boosts attention with top-k operations and multiscale dynamic methods. The decoding phase utilizes bilinear upsampling and sophisticated semantic guidance to enhance low-level characteristics, hence ensuring precise segmentation. Enhanced performance is achieved by deep supervision and a hybrid loss function. Experiments applying COD datasets (NC4K, COD10K, CAMO) demonstrate that ATDMNet establishes a new benchmark in both precision and efficiency.

## Full-text entities

- **Diseases:** injury to (MESH:D014947), COD (MESH:D014012)
- **Chemicals:** V (MESH:D014639), K (MESH:D011188)
- **Species:** Zootoca vivipara (common lizard, species) [taxon 8524], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12114713/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12114713/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/PMC12114713/full.md

---
Source: https://tomesphere.com/paper/PMC12114713