Learning Implicit Temporal Alignment for Few-shot Video Classification

Songyang Zhang; Jiale Zhou; Xuming He

arXiv:2105.04823·cs.CV·May 12, 2021·1 cites

Learning Implicit Temporal Alignment for Few-shot Video Classification

Songyang Zhang, Jiale Zhou, Xuming He

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel implicit temporal alignment method for few-shot video classification, improving similarity estimation between video pairs and enhancing generalization across classes.

Contribution

It proposes a matching-based strategy with implicit temporal alignment and a context encoding module, advancing few-shot video classification techniques.

Findings

01

Outperforms prior methods on SomethingSomething-V2

02

Achieves competitive results on Kinetics

03

Demonstrates robust similarity estimation in few-shot settings

Abstract

Few-shot video classification aims to learn new video categories with only a few labeled examples, alleviating the burden of costly annotation in real-world applications. However, it is particularly challenging to learn a class-invariant spatial-temporal representation in such a setting. To address this, we propose a novel matching-based few-shot learning strategy for video sequences in this work. Our main idea is to introduce an implicit temporal alignment for a video pair, capable of estimating the similarity between them in an accurate and robust manner. Moreover, we design an effective context encoding module to incorporate spatial and feature channel context, resulting in better modeling of intra-class variations. To train our model, we develop a multi-task loss for learning video matching, leading to video features with better generalization. Extensive experimental results on two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tonysy/PyAction
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Cancer-related molecular mechanisms research · Video Analysis and Summarization