CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models
Kushal Kumar Jain, Steve Grosz, Anoop M. Namboodiri, Anil K. Jain

TL;DR
This paper introduces CLIP4Sketch, a method that uses diffusion models to generate diverse, high-quality sketches from mugshots, significantly improving face recognition accuracy in forensic sketch matching tasks.
Contribution
We propose a novel diffusion model-based approach to generate synthetic sketches, enhancing dataset diversity and face recognition performance in sketch-to-mugshot matching.
Findings
Synthetic sketches improve recognition accuracy.
Diffusion-generated data outperforms GAN-based datasets.
Enhanced dataset leads to better forensic face matching.
Abstract
Forensic sketch-to-mugshot matching is a challenging task in face recognition, primarily hindered by the scarcity of annotated forensic sketches and the modality gap between sketches and photographs. To address this, we propose CLIP4Sketch, a novel approach that leverages diffusion models to generate a large and diverse set of sketch images, which helps in enhancing the performance of face recognition systems in sketch-to-mugshot matching. Our method utilizes Denoising Diffusion Probabilistic Models (DDPMs) to generate sketches with explicit control over identity and style. We combine CLIP and Adaface embeddings of a reference mugshot, along with textual descriptions of style, as the conditions to the diffusion model. We demonstrate the efficacy of our approach by generating a comprehensive dataset of sketches corresponding to mugshots and training a face recognition model on our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training · Contrastive Language-Image Pre-training · Diffusion
