Multi-Task Zero-Shot Action Recognition with Prioritised Data   Augmentation

Xun Xu; Timothy M. Hospedales; Shaogang Gong

arXiv:1611.08663·cs.CV·November 29, 2016

Multi-Task Zero-Shot Action Recognition with Prioritised Data Augmentation

Xun Xu, Timothy M. Hospedales, Shaogang Gong

PDF

TL;DR

This paper enhances zero-shot action recognition by developing a multi-task visual-semantic mapping and a dynamic data re-weighting strategy to better handle domain shift and improve generalisation to new classes.

Contribution

It introduces a multi-task visual-semantic mapping constrained to a low-dimensional manifold and a prioritised data augmentation method for improved zero-shot learning.

Findings

01

Improved generalisation in zero-shot action recognition.

02

Enhanced accuracy over existing ZSL models.

03

Effective handling of domain shift through data prioritisation.

Abstract

Zero-Shot Learning (ZSL) promises to scale visual recognition by bypassing the conventional model training requirement of annotated examples for every category. This is achieved by establishing a mapping connecting low-level features and a semantic description of the label space, referred as visual-semantic mapping, on auxiliary data. Reusing the learned mapping to project target videos into an embedding space thus allows novel-classes to be recognised by nearest neighbour inference. However, existing ZSL methods suffer from auxiliary-target domain shift intrinsically induced by assuming the same mapping for the disjoint auxiliary and target classes. This compromises the generalisation accuracy of ZSL recognition on the target data. In this work, we improve the ability of ZSL to generalise across this domain shift in both model- and data-centric ways by formulating a visual-semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.