Extending Information Bottleneck Attribution to Video Sequences

Veronika Solopova; Lucas Schmidt; Dorothea Kolossa

arXiv:2501.16889·cs.CV·January 29, 2025

Extending Information Bottleneck Attribution to Video Sequences

Veronika Solopova, Lucas Schmidt, Dorothea Kolossa

PDF

Open Access 1 Repo

TL;DR

VIBA is a new explainability method for video classification that adapts Information Bottleneck Attribution to highlight manipulated regions and motion inconsistencies in deepfake detection, providing consistent and human-aligned explanations.

Contribution

This work extends IBA to video sequences, enabling explainability in temporal models for video analysis, especially for deepfake detection.

Findings

01

VIBA produces temporally and spatially consistent explanations.

02

VIBA's relevance maps align closely with human annotations.

03

Effective in highlighting manipulated regions and motion inconsistencies.

Abstract

We introduce VIBA, a novel approach for explainable video classification by adapting Information Bottlenecks for Attribution (IBA) to video sequences. While most traditional explainability methods are designed for image models, our IBA framework addresses the need for explainability in temporal models used for video analysis. To demonstrate its effectiveness, we apply VIBA to video deepfake detection, testing it on two architectures: the Xception model for spatial features and a VGG11-based model for capturing motion dynamics through optical flow. Using a custom dataset that reflects recent deepfake generation techniques, we adapt IBA to create relevance and optical flow maps, visually highlighting manipulated regions and motion inconsistencies. Our results show that VIBA generates temporally and spatially consistent explanations, which align closely with human annotations, thus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anonrep/iba-for-video-sequences
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Visual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques

MethodsResidual Connection · Average Pooling · Depthwise Convolution · Global Average Pooling · Pointwise Convolution · Max Pooling · Depthwise Separable Convolution · ALIGN · Dense Connections · 1x1 Convolution