Camouflaged Object Detection with Feature Grafting and Distractor Aware

Yuxuan Song; Xinyue Li; Lin Qi

arXiv:2307.03943·cs.CV·July 11, 2023·2 cites

Camouflaged Object Detection with Feature Grafting and Distractor Aware

Yuxuan Song, Xinyue Li, Lin Qi

PDF

Open Access 1 Repo

TL;DR

This paper introduces FDNet, a novel neural network combining CNN and Transformer encoders with feature grafting and distractor modeling to improve camouflaged object detection, supported by a new large dataset and extensive experiments.

Contribution

The paper proposes a new FDNet architecture with feature grafting and distractor awareness for improved camouflaged object detection, along with the largest dataset ACOD2K.

Findings

01

FDNet outperforms state-of-the-art methods on multiple benchmarks.

02

The cross-attention feature grafting enhances feature integration.

03

The distractor aware module refines camouflage detection accuracy.

Abstract

The task of Camouflaged Object Detection (COD) aims to accurately segment camouflaged objects that integrated into the environment, which is more challenging than ordinary detection as the texture between the target and background is visually indistinguishable. In this paper, we proposed a novel Feature Grafting and Distractor Aware network (FDNet) to handle the COD task. Specifically, we use CNN and Transformer to encode multi-scale images in parallel. In order to better explore the advantages of the two encoders, we design a cross-attention-based Feature Grafting Module to graft features extracted from Transformer branch into CNN branch, after which the features are aggregated in the Feature Fusion Module. A Distractor Aware Module is designed to explicitly model the two possible distractors in the COD task to refine the coarse camouflage map. We also proposed the largest artificial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

syxvision/fdnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Image Enhancement Techniques · Advanced Image and Video Retrieval Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Dense Connections · Softmax · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Dropout