# Domain-Specific Priors and Meta Learning for Few-Shot First-Person   Action Recognition

**Authors:** Huseyin Coskun, Zeeshan Zia, Bugra Tekin, Federica Bogo, Nassir Navab,, Federico Tombari, Harpreet Sawhney

arXiv: 1907.09382 · 2021-12-09

## TL;DR

This paper introduces a meta-learning approach utilizing domain-specific visual cues for effective few-shot first-person action recognition across diverse datasets, addressing the challenge of limited annotated data.

## Contribution

It proposes a novel meta-learning framework that leverages local visual cues for transfer learning in first-person action recognition, enabling cross-dataset generalization with minimal examples.

## Key findings

- Outperforms state-of-the-art methods in few-shot transfer scenarios.
- Effective transfer across diverse datasets with different scene and action configurations.
- Utilizes domain-invariant visual cues for robust action classification.

## Abstract

The lack of large-scale real datasets with annotations makes transfer learning a necessity for video activity understanding. We aim to develop an effective method for few-shot transfer learning for first-person action classification. We leverage independently trained local visual cues to learn representations that can be transferred from a source domain, which provides primitive action labels, to a different target domain using only a handful of examples. Visual cues we employ include object-object interactions, hand grasps and motion within regions that are a function of hand locations. We employ a framework based on meta-learning to extract the distinctive and domain invariant components of the deployed visual cues. This enables transfer of action classification models across public datasets captured with diverse scene and action configurations. We present comparative results of our transfer learning methodology and report superior results over state-of-the-art action classification approaches for both inter-class and inter-dataset transfer.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.09382/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1907.09382/full.md

## References

87 references — full list in the complete paper: https://tomesphere.com/paper/1907.09382/full.md

---
Source: https://tomesphere.com/paper/1907.09382