Improving Spatiotemporal Self-Supervision by Deep Reinforcement Learning

Uta B\"uchler; Biagio Brattoli; Bj\"orn Ommer

arXiv:1807.11293·cs.CV·July 31, 2018·1 cites

Improving Spatiotemporal Self-Supervision by Deep Reinforcement Learning

Uta B\"uchler, Biagio Brattoli, Bj\"orn Ommer

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning-based sampling policy for self-supervised learning of CNNs, improving the selection of training permutations to enhance feature learning in image and video tasks.

Contribution

It proposes a novel deep reinforcement learning approach to adaptively sample permutations for self-supervised learning, surpassing random sampling methods.

Findings

01

Achieves competitive results on image classification benchmarks.

02

Demonstrates improved feature representations for video classification.

03

Enhances unsupervised and transfer learning performance.

Abstract

Self-supervised learning of convolutional neural networks can harness large amounts of cheap unlabeled data to train powerful feature representations. As surrogate task, we jointly address ordering of visual data in the spatial and temporal domain. The permutations of training samples, which are at the core of self-supervision by ordering, have so far been sampled randomly from a fixed preselected set. Based on deep reinforcement learning we propose a sampling policy that adapts to the state of the network, which is being trained. Therefore, new permutations are sampled according to their expected utility for updating the convolutional feature representation. Experimental evaluation on unsupervised and transfer learning tasks demonstrates competitive performance on standard benchmarks for image and video classification and nearest neighbor retrieval.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · AI in cancer detection · Multimodal Machine Learning Applications