# Self-Supervised Learning of Face Representations for Video Face   Clustering

**Authors:** Vivek Sharma, Makarand Tapaswi, M.Saquib Sarfraz, Rainer Stiefelhagen

arXiv: 1903.01000 · 2019-03-05

## TL;DR

This paper introduces a self-supervised Siamese network for video face clustering that outperforms existing methods by effectively extracting identity information from deep face representations without requiring video-specific supervision.

## Contribution

The paper presents a novel self-supervised learning approach for face clustering that works on both videos and images, improving clustering accuracy without labeled data.

## Key findings

- Outperforms state-of-the-art on three datasets
- Effective without video/track supervision
- Applicable to both video and image collections

## Abstract

Analyzing the story behind TV series and movies often requires understanding who the characters are and what they are doing. With improving deep face models, this may seem like a solved problem. However, as face detectors get better, clustering/identification needs to be revisited to address increasing diversity in facial appearance. In this paper, we address video face clustering using unsupervised methods. Our emphasis is on distilling the essential information, identity, from the representations obtained using deep pre-trained face networks. We propose a self-supervised Siamese network that can be trained without the need for video/track based supervision, and thus can also be applied to image collections. We evaluate our proposed method on three video face clustering datasets. The experiments show that our methods outperform current state-of-the-art methods on all datasets. Video face clustering is lacking a common benchmark as current works are often evaluated with different metrics and/or different sets of face tracks.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.01000/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1903.01000/full.md

## References

49 references — full list in the complete paper: https://tomesphere.com/paper/1903.01000/full.md

---
Source: https://tomesphere.com/paper/1903.01000