Interpretable Face Manipulation Detection via Feature Whitening

Yingying Hua; Daichi Zhang; Pengju Wang; Shiming Ge

arXiv:2106.10834·cs.CV·June 22, 2021

Interpretable Face Manipulation Detection via Feature Whitening

Yingying Hua, Daichi Zhang, Pengju Wang, Shiming Ge

PDF

Open Access

TL;DR

This paper introduces an interpretable face manipulation detection method that enhances trustworthiness by making the detection process transparent through feature whitening, balancing accuracy and interpretability.

Contribution

It proposes a novel feature whitening module that decorrelates features to improve interpretability without sacrificing detection accuracy.

Findings

01

Balances detection accuracy and interpretability

02

Feature whitening improves model transparency

03

Achieves trustworthy face manipulation detection

Abstract

Why should we trust the detections of deep neural networks for manipulated faces? Understanding the reasons is important for users in improving the fairness, reliability, privacy and trust of the detection models. In this work, we propose an interpretable face manipulation detection approach to achieve the trustworthy and accurate inference. The approach could make the face manipulation detection process transparent by embedding the feature whitening module. This module aims to whiten the internal working mechanism of deep networks through feature decorrelation and feature constraint. The experimental results demonstrate that our proposed approach can strike a balance between the detection accuracy and the model interpretability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Face recognition and analysis · Generative Adversarial Networks and Image Synthesis