GLCONet: Learning Multi-source Perception Representation for Camouflaged   Object Detection

Yanguang Sun; Hanyu Xuan; Jian Yang; Lei Luo

arXiv:2409.09588·cs.CV·September 17, 2024·3 cites

GLCONet: Learning Multi-source Perception Representation for Camouflaged Object Detection

Yanguang Sun, Hanyu Xuan, Jian Yang, Lei Luo

PDF

Open Access 1 Repo

TL;DR

GLCONet introduces a global-local collaborative framework for camouflaged object detection, leveraging multi-source perception to improve feature representation and outperform existing methods.

Contribution

It proposes a novel global-local collaborative optimization strategy and an adjacent reverse decoder to enhance feature discrimination and integration in COD.

Findings

01

Outperforms twenty state-of-the-art methods on three datasets.

02

Effectively activates significant pixels in images.

03

Compatible with different backbone networks.

Abstract

Recently, biological perception has been a powerful tool for handling the camouflaged object detection (COD) task. However, most existing methods are heavily dependent on the local spatial information of diverse scales from convolutional operations to optimize initial features. A commonly neglected point in these methods is the long-range dependencies between feature pixels from different scale spaces that can help the model build a global structure of the object, inducing a more precise image representation. In this paper, we propose a novel Global-Local Collaborative Optimization Network, called GLCONet. Technically, we first design a collaborative optimization strategy from the perspective of multi-source perception to simultaneously model the local details and global long-range relationships, which can provide features with abundant discriminative information to boost the accuracy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

csysi/glconet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Video Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques