Domain-Adaptive Self-Supervised Pre-Training for Face & Body Detection in Drawings
Bar{\i}\c{s} Batuhan Topal, Deniz Yuret, Tevfik Metin Sezgin

TL;DR
This paper introduces a domain-adaptive self-supervised pre-training method for face and body detection in drawings, leveraging unlabeled data and style transfer to achieve state-of-the-art results with minimal annotations.
Contribution
It proposes a novel teacher-student self-supervised learning framework with style transfer integration for detecting faces and bodies in diverse drawing styles.
Findings
Achieves state-of-the-art detection performance with limited labeled data.
Effectively utilizes unlabeled target domain data for training.
Incorporates style transfer to leverage out-of-domain labeled images.
Abstract
Drawings are powerful means of pictorial abstraction and communication. Understanding diverse forms of drawings, including digital arts, cartoons, and comics, has been a major problem of interest for the computer vision and computer graphics communities. Although there are large amounts of digitized drawings from comic books and cartoons, they contain vast stylistic variations, which necessitate expensive manual labeling for training domain-specific recognizers. In this work, we show how self-supervised learning, based on a teacher-student network with a modified student network update design, can be used to build face and body detectors. Our setup allows exploiting large amounts of unlabeled data from the target domain when labels are provided for only a small subset of it. We further demonstrate that style transfer can be incorporated into our learning pipeline to bootstrap detectors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Human Pose and Action Recognition · Video Surveillance and Tracking Methods
