About Time: Advances, Challenges, and Outlooks of Action Understanding

Alexandros Stergiou; Ronald Poppe

arXiv:2411.15106·cs.CV·May 7, 2025

About Time: Advances, Challenges, and Outlooks of Action Understanding

Alexandros Stergiou, Ronald Poppe

PDF

Open Access

TL;DR

This survey reviews recent progress in video action understanding, highlighting advances, challenges, datasets, and future directions across recognition, prediction, and forecasting tasks in uni- and multi-modal settings.

Contribution

It provides a comprehensive overview of recent developments, challenges, and datasets in action understanding, emphasizing temporal scopes and future research directions.

Findings

01

Significant performance improvements due to larger datasets and computation.

02

Diverse applications including scene description, segmentation, synthesis, and context prediction.

03

Identification of key challenges and future research directions in the field.

Abstract

We have witnessed impressive advances in video action understanding. Increased dataset sizes, variability, and computation availability have enabled leaps in performance and task diversification. Current systems can provide coarse- and fine-grained descriptions of video scenes, extract segments corresponding to queries, synthesize unobserved parts of videos, and predict context across multiple modalities. This survey comprehensively reviews advances in uni- and multi-modal action understanding across a range of tasks. We focus on prevalent challenges, overview widely adopted datasets, and survey seminal works with an emphasis on recent advances. We broadly distinguish between three temporal scopes: (1) recognition tasks of actions observed in full, (2) prediction tasks for ongoing partially observed actions, and (3) forecasting tasks for subsequent unobserved action(s). This division…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making

MethodsFocus