TL;DR
This paper introduces a dynamic multi-task deep learning approach for caricature-visual face recognition, effectively handling extreme distortions and improving recognition accuracy over existing methods.
Contribution
It proposes a novel dynamic multi-task learning framework that adjusts task weights during training, enhancing performance in cross-modal caricature-visual face recognition.
Findings
Outperforms state-of-the-art methods on CaVI and WebCaricature datasets.
Demonstrates improved recognition accuracy with dynamic task weighting.
Effective handling of non-rigid caricature distortions.
Abstract
Rather than the visual images, the face recognition of the caricatures is far from the performance of the visual images. The challenge is the extreme non-rigid distortions of the caricatures introduced by exaggerating the facial features to strengthen the characters. In this paper, we propose dynamic multi-task learning based on deep CNNs for cross-modal caricature-visual face recognition. Instead of the conventional multi-task learning with fixed weights of the tasks, the proposed dynamic multi-task learning dynamically updates the weights of tasks according to the importance of the tasks, which enables the training of the networks focus on the hard task instead of being stuck in the overtraining of the easy task. The experimental results demonstrate the effectiveness of the dynamic multi-task learning for caricature-visual face recognition. The performance evaluated on the datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
