What Actions are Needed for Understanding Human Actions in Videos?

Gunnar A. Sigurdsson; Olga Russakovsky; Abhinav Gupta

arXiv:1708.02696·cs.CV·August 10, 2017

What Actions are Needed for Understanding Human Actions in Videos?

Gunnar A. Sigurdsson, Olga Russakovsky, Abhinav Gupta

PDF

1 Repo

TL;DR

This paper analyzes current methods, datasets, and metrics for human activity understanding in videos, highlighting challenges and promising directions for improving algorithmic accuracy through integrated object, pose, and temporal reasoning.

Contribution

It provides a comprehensive analysis of the state of human activity understanding, identifying key factors and future research directions to enhance performance.

Findings

01

Current datasets allow effective benchmarking despite activity ambiguity.

02

Combining object, pose, and temporal reasoning improves accuracy.

03

Future progress depends on integrating multiple information types.

Abstract

What is the right way to reason about human activities? What directions forward are most promising? In this work, we analyze the current state of human activity understanding in videos. The goal of this paper is to examine datasets, evaluation metrics, algorithms, and potential future directions. We look at the qualitative attributes that define activities such as pose variability, brevity, and density. The experiments consider multiple state-of-the-art algorithms and multiple datasets. The results demonstrate that while there is inherent ambiguity in the temporal extent of activities, current datasets still permit effective benchmarking. We discover that fine-grained understanding of objects and pose when combined with temporal reasoning is likely to yield substantial improvements in algorithmic accuracy. We present the many kinds of information that will be needed to achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gsig/actions-for-actions
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.