Contrastive Video Representation Learning via Adversarial Perturbations

Jue Wang; Anoop Cherian

arXiv:1807.09380·cs.CV·April 17, 2020·1 cites

Contrastive Video Representation Learning via Adversarial Perturbations

Jue Wang, Anoop Cherian

PDF

Open Access

TL;DR

This paper introduces a novel contrastive learning method for video representations that uses adversarial perturbations to generate negative samples, resulting in more robust and discriminative video features.

Contribution

It proposes a new contrastive learning framework utilizing adversarial perturbations to create negative samples and a discriminative subspace pooling method for improved video representation.

Findings

01

Achieves state-of-the-art results on multiple video datasets.

02

Demonstrates robustness of learned representations against adversarial noise.

03

Introduces a subspace learning approach on the Stiefel manifold for video descriptors.

Abstract

Adversarial perturbations are noise-like patterns that can subtly change the data, while failing an otherwise accurate classifier. In this paper, we propose to use such perturbations within a novel contrastive learning setup to build negative samples, which are then used to produce improved video representations. To this end, given a well-trained deep model for per-frame video recognition, we first generate adversarial noise adapted to this model. Positive and negative bags are produced using the original data features from the full video sequence and their perturbed counterparts, respectively. Unlike the classic contrastive learning methods, we develop a binary classification problem that learns a set of discriminative hyperplanes -- as a subspace -- that will separate the two bags from each other. This subspace is then used as a descriptor for the video, dubbed \emph{discriminative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning

MethodsAverage Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling