Enhancing Noise Resilience in Face Clustering via Sparse Differential Transformer
Dafeng Zhang, Yongqi Song, Shizhuo Liu

TL;DR
This paper introduces a Sparse Differential Transformer to improve noise resilience in face clustering by enhancing similarity measurement accuracy and reducing irrelevant node influence, leading to state-of-the-art results.
Contribution
We propose a novel Sparse Differential Transformer that effectively eliminates noise and improves the reliability of similarity measurements in face clustering.
Findings
Achieves state-of-the-art performance on MS-Celeb-1M dataset.
Outperforms existing methods in clustering accuracy.
Enhances robustness against noise in face embedding relationships.
Abstract
The method used to measure relationships between face embeddings plays a crucial role in determining the performance of face clustering. Existing methods employ the Jaccard similarity coefficient instead of the cosine distance to enhance the measurement accuracy. However, these methods introduce too many irrelevant nodes, producing Jaccard coefficients with limited discriminative power and adversely affecting clustering performance. To address this issue, we propose a prediction-driven Top-K Jaccard similarity coefficient that enhances the purity of neighboring nodes, thereby improving the reliability of similarity measurements. Nevertheless, accurately predicting the optimal number of neighbors (Top-K) remains challenging, leading to suboptimal clustering results. To overcome this limitation, we develop a Transformer-based prediction model that examines the relationships between the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsFace recognition and analysis · Face and Expression Recognition · Emotion and Mood Recognition
