Entropy-driven Unsupervised Keypoint Representation Learning in Videos

Ali Younes; Simone Schaub-Meyer; Georgia Chalvatzaki

arXiv:2209.15404·cs.CV·June 7, 2023

Entropy-driven Unsupervised Keypoint Representation Learning in Videos

Ali Younes, Simone Schaub-Meyer, Georgia Chalvatzaki

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces an unsupervised method for learning meaningful video representations by leveraging local entropy to identify keypoints that capture dynamic information, improving object and scene understanding.

Contribution

The paper proposes a novel entropy-based unsupervised learning framework that discovers spatially and temporally consistent keypoints directly from video frames, guided by two information-theoretic losses.

Findings

01

Outperforms baselines in downstream tasks like object dynamics

02

Effectively captures static and dynamic objects in videos

03

Enhances understanding of scene changes and object movements

Abstract

Extracting informative representations from videos is fundamental for effectively learning various downstream tasks. We present a novel approach for unsupervised learning of meaningful representations from videos, leveraging the concept of image spatial entropy (ISE) that quantifies the per-pixel information in an image. We argue that \textit{local entropy} of pixel neighborhoods and their temporal evolution create valuable intrinsic supervisory signals for learning prominent features. Building on this idea, we abstract visual features into a concise representation of keypoints that act as dynamic information transmitters, and design a deep learning model that learns, purely unsupervised, spatially and temporally consistent representations \textit{directly} from video frames. Two original information-theoretic losses, computed from local entropy, guide our model to discover consistent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

irosa-lab/mint
pytorchOfficial

Videos

Entropy-driven Unsupervised Keypoint Representation Learning in Videos· slideslive

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications