Frequency-Spatial Entanglement Learning for Camouflaged Object Detection

Yanguang Sun; Chunyan Xu; Jian Yang; Hanyu Xuan; and Lei Luo

arXiv:2409.01686·cs.CV·September 4, 2024·5 cites

Frequency-Spatial Entanglement Learning for Camouflaged Object Detection

Yanguang Sun, Chunyan Xu, Jian Yang, Hanyu Xuan, and Lei Luo

PDF

Open Access 1 Repo

TL;DR

This paper introduces Frequency-Spatial Entanglement Learning (FSEL), a novel method combining frequency and spatial domain features to improve camouflaged object detection, outperforming existing approaches.

Contribution

The paper proposes a new FSEL approach with Entanglement Transformer Blocks and dual-domain feature integration for better camouflaged object detection.

Findings

01

FSEL outperforms 21 state-of-the-art methods.

02

Extensive experiments validate the effectiveness of the approach.

03

The method achieves superior results on three benchmark datasets.

Abstract

Camouflaged object detection has attracted a lot of attention in computer vision. The main challenge lies in the high degree of similarity between camouflaged objects and their surroundings in the spatial domain, making identification difficult. Existing methods attempt to reduce the impact of pixel similarity by maximizing the distinguishing ability of spatial features with complicated design, but often ignore the sensitivity and locality of features in the spatial domain, leading to sub-optimal results. In this paper, we propose a new approach to address this issue by jointly exploring the representation in the frequency and spatial domains, introducing the Frequency-Spatial Entanglement Learning (FSEL) method. This method consists of a series of well-designed Entanglement Transformer Blocks (ETB) for representation learning, a Joint Domain Perception Module for semantic enhancement,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

csysi/fsel
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Image Enhancement Techniques · Advanced Vision and Imaging

MethodsByte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Linear Layer · Adam · Dropout · Layer Normalization · Dense Connections · Attention Is All You Need