Utilizing Grounded SAM for self-supervised frugal camouflaged human   detection

Matthias Pijarowski; Alexander Wolpert; Martin Heckmann; Michael; Teutsch

arXiv:2406.05776·cs.CV·June 11, 2024

Utilizing Grounded SAM for self-supervised frugal camouflaged human detection

Matthias Pijarowski, Alexander Wolpert, Martin Heckmann, Michael, Teutsch

PDF

TL;DR

This paper explores self-supervised and frugal learning methods for camouflaged human detection, fine-tuning existing models with limited labeled data, and demonstrates comparable performance to fully supervised approaches.

Contribution

It introduces self-supervised pseudo-labeling techniques for camouflaged human detection, reducing reliance on large labeled datasets and adapting existing models effectively.

Findings

01

Self-supervised methods achieve similar performance to supervised ones.

02

Frugal transfer learning effectively fine-tunes models for camouflaged human detection.

03

Pseudo-labeling approaches are viable for reducing annotation effort.

Abstract

Visually detecting camouflaged objects is a hard problem for both humans and computer vision algorithms. Strong similarities between object and background appearance make the task significantly more challenging than traditional object detection or segmentation tasks. Current state-of-the-art models use either convolutional neural networks or vision transformers as feature extractors. They are trained in a fully supervised manner and thus need a large amount of labeled training data. In this paper, both self-supervised and frugal learning methods are introduced to the task of Camouflaged Object Detection (COD). The overall goal is to fine-tune two COD reference methods, namely SINet-V2 and HitNet, pre-trained for camouflaged animal detection to the task of camouflaged human detection. Therefore, we use the public dataset CPD1K that contains camouflaged humans in a forest environment. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.