Dynamic Deep Multi-task Learning for Caricature-Visual Face Recognition

Zuheng Ming; Jean-Christophe Burie; Muhammad Muzzamil Luqman

arXiv:1911.03341·cs.CV·November 11, 2019

Dynamic Deep Multi-task Learning for Caricature-Visual Face Recognition

Zuheng Ming, Jean-Christophe Burie, Muhammad Muzzamil Luqman

PDF

1 Repo

TL;DR

This paper introduces a dynamic multi-task deep learning approach for caricature-visual face recognition, effectively handling extreme distortions and improving recognition accuracy over existing methods.

Contribution

It proposes a novel dynamic multi-task learning framework that adjusts task weights during training, enhancing performance in cross-modal caricature-visual face recognition.

Findings

01

Outperforms state-of-the-art methods on CaVI and WebCaricature datasets.

02

Demonstrates improved recognition accuracy with dynamic task weighting.

03

Effective handling of non-rigid caricature distortions.

Abstract

Rather than the visual images, the face recognition of the caricatures is far from the performance of the visual images. The challenge is the extreme non-rigid distortions of the caricatures introduced by exaggerating the facial features to strengthen the characters. In this paper, we propose dynamic multi-task learning based on deep CNNs for cross-modal caricature-visual face recognition. Instead of the conventional multi-task learning with fixed weights of the tasks, the proposed dynamic multi-task learning dynamically updates the weights of tasks according to the importance of the tasks, which enables the training of the networks focus on the hard task instead of being stuck in the overtraining of the easy task. The experimental results demonstrate the effectiveness of the dynamic multi-task learning for caricature-visual face recognition. The performance evaluated on the datasets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hengxyz/cari-visual-recognition-via-multitask-learning
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.