# Camouflaged object detection via context and texture-aware hierarchical interaction

**Authors:** Zhi Wang, Yangyang Deng, Chenxing Shen, Miaohui Zhang, Xiaoxia Lu

PMC · DOI: 10.1038/s41598-025-32409-9 · 2026-03-18

## TL;DR

This paper introduces a new network for detecting camouflaged objects by combining context and texture information, achieving better performance than existing methods.

## Contribution

The novel CTHINet uses texture labels and hierarchical interaction modules to enhance detection accuracy in camouflaged object detection.

## Key findings

- CTHINet outperforms state-of-the-art methods on benchmark COD datasets like CAMO, COD10K, and NC4K.
- The model's performance on the polyp segmentation dataset suggests its potential for broader applications.
- Hierarchical interaction modules improve the integration of texture and context cues for accurate detection.

## Abstract

In the field of camouflaged object detection (COD), effectively distinguishing the intrinsic similarity between objects and their backgrounds is a critical factor for improving detection performance. Existing approaches typically leverage boundary constraints to provide additional auxiliary information during the training phase. To capture more discriminative detailed cues, we introduce texture labels as supervisory signals and propose a context- and texture-aware hierarchical interaction network (CTHINet) for COD. In the coding phase, the network is divided into two separate branches, a context and a texture encoder. Specifically, a context encoder is employed to generate contextual information. Subsequently, the features at different scales are refined by implementing a Multi-head Feature Aggregation Module (MFAM). The diversity of features is subsequently enhanced by leveraging the interactions among their distinct feature receptive fields, facilitating the matching of candidate areas for camouflaged objects with varying sizes and shapes. Following this, the enhanced features are combined with texture features generated by the texture encoder, fully exploiting imperceptible cues within candidate objects through utilizing the Hierarchical mixed-scale Interaction Modules (HMIM). This module continuously integrates texture cues with contextual information within a single feature scale, aiming for more accurate detection. Extensive experiments conducted on three challenging benchmark datasets, e.g., CAMO, COD10K, and NC4K, illustrate that our model has superior performance compared to state-of-the-art methods. Furthermore, the evaluation results on the polyp segmentation dataset underscore the promising potential of CTHINet for downstream applications.

## Full-text entities

- **Genes:** A1BG (alpha-1-B glycoprotein) [NCBI Gene 1] {aka A1B, ABG, GAB, HYST2477}
- **Diseases:** COVID-19 (MESH:D000086382), COD (MESH:D014012), CAMO (MESH:C537548), lung infection (MESH:D012141), polyp (MESH:D011127), HMIM (MESH:C538175)
- **Chemicals:** MFAM (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13000261/full.md

---
Source: https://tomesphere.com/paper/PMC13000261