CAST: Character labeling in Animation using Self-supervision by Tracking
Oron Nir, Gal Rapoport, Ariel Shamir

TL;DR
This paper introduces a self-supervised method to improve character labeling in animation videos by refining semantic representations tailored to specific styles, enabling more accurate clustering and classification of animated characters.
Contribution
The authors propose a novel self-supervised approach that refines semantic embeddings for animated content, facilitating automatic character labeling across diverse animation styles.
Findings
Enhanced clustering of animated characters in refined semantic space
Effective creation of character dictionaries with minimal user effort
Improved labeling accuracy across various animation styles
Abstract
Cartoons and animation domain videos have very different characteristics compared to real-life images and videos. In addition, this domain carries a large variability in styles. Current computer vision and deep-learning solutions often fail on animated content because they were trained on natural images. In this paper we present a method to refine a semantic representation suitable for specific animated content. We first train a neural network on a large-scale set of animation videos and use the mapping to deep features as an embedding space. Next, we use self-supervision to refine the representation for any specific animation style by gathering many examples of animated characters in this style, using a multi-object tracking. These examples are used to define triplets for contrastive loss training. The refined semantic space allows better clustering of animated characters even when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Video Analysis and Summarization · Image Retrieval and Classification Techniques
MethodsSupervised Contrastive Loss
