Robust Character Labeling in Movie Videos: Data Resources and   Self-supervised Feature Adaptation

Krishna Somandepalli; Rajat Hebbar; Shrikanth Narayanan

arXiv:2008.11289·cs.CV·March 1, 2022

Robust Character Labeling in Movie Videos: Data Resources and Self-supervised Feature Adaptation

Krishna Somandepalli, Rajat Hebbar, Shrikanth Narayanan

PDF

TL;DR

This paper introduces a large-scale dataset and novel adaptation methods for robust face clustering in movies, addressing domain-specific challenges and improving face verification and clustering performance.

Contribution

It provides a new dataset of 169,000 face tracks from movies and proposes multiview correlation-based adaptation for better face embedding robustness.

Findings

01

Multiview correlation-based adaptation improves face embedding discriminability.

02

Weakly labeled data effectively enhances domain-specific feature adaptation.

03

The developed resources facilitate future research in character labeling in videos.

Abstract

Robust face clustering is a vital step in enabling computational understanding of visual character portrayal in media. Face clustering for long-form content is challenging because of variations in appearance and lack of supporting large-scale labeled data. Our work in this paper focuses on two key aspects of this problem: the lack of domain-specific training or benchmark datasets, and adapting face embeddings learned on web images to long-form content, specifically movies. First, we present a dataset of over 169,000 face tracks curated from 240 Hollywood movies with weak labels on whether a pair of face tracks belong to the same or a different character. We propose an offline algorithm based on nearest-neighbor search in the embedding space to mine hard-examples from these tracks. We then investigate triplet-loss and multiview correlation-based methods for adapting face embeddings to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.