A Challenging Benchmark of Anime Style Recognition
Haotang Li, Shengtao Guo, Kailin Lyu, Xiao Yang, Tianchen Chen, Jianqing Zhu, Huanqiang Zeng

TL;DR
This paper introduces a large-scale, challenging benchmark dataset for anime style recognition, highlighting the difficulty of learning abstract painting styles across diverse images and evaluating current methods' performance.
Contribution
The paper presents a new large-scale dataset (LSASRD) for anime style recognition, along with a cross-role evaluation protocol and baseline results using advanced re-identification models.
Findings
Transformers achieve only 42.24% mAP on LSASRD.
The task presents a significant semantic gap, indicating the need for further research.
Current models struggle with the dataset's complexity and variability.
Abstract
Given two images of different anime roles, anime style recognition (ASR) aims to learn abstract painting style to determine whether the two images are from the same work, which is an interesting but challenging problem. Unlike biometric recognition, such as face recognition, iris recognition, and person re-identification, ASR suffers from a much larger semantic gap but receives less attention. In this paper, we propose a challenging ASR benchmark. Firstly, we collect a large-scale ASR dataset (LSASRD), which contains 20,937 images of 190 anime works and each work at least has ten different roles. In addition to the large-scale, LSASRD contains a list of challenging factors, such as complex illuminations, various poses, theatrical colors and exaggerated compositions. Secondly, we design a cross-role protocol to evaluate ASR performance, in which query and gallery images must come from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Handwritten Text Recognition Techniques · Generative Adversarial Networks and Image Synthesis
