TL;DR
This paper introduces GraphJigsaw, a novel self-supervised learning method that constructs and solves shape-based jigsaw puzzles within CNNs using GCNs, significantly improving cartoon face recognition accuracy.
Contribution
The paper proposes GraphJigsaw, a new approach that integrates jigsaw puzzles and graph convolutional networks into CNNs for better shape pattern learning in cartoon face recognition.
Findings
GraphJigsaw outperforms existing methods on cartoon face datasets.
It effectively learns shape-oriented representations without extra annotations.
The method improves recognition accuracy with no additional inference cost.
Abstract
Cartoon face recognition is challenging as they typically have smooth color regions and emphasized edges, the key to recognize cartoon faces is to precisely perceive their sparse and critical shape patterns. However, it is quite difficult to learn a shape-oriented representation for cartoon face recognition with convolutional neural networks (CNNs). To mitigate this issue, we propose the GraphJigsaw that constructs jigsaw puzzles at various stages in the classification network and solves the puzzles with the graph convolutional network (GCN) in a progressive manner. Solving the puzzles requires the model to spot the shape patterns of the cartoon faces as the texture information is quite limited. The key idea of GraphJigsaw is constructing a jigsaw puzzle by randomly shuffling the intermediate convolutional feature maps in the spatial dimension and exploiting the GCN to reason and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsJigsaw · Graph Convolutional Network
