Human Action Co-occurrence in Lifestyle Vlogs using Graph Link   Prediction

Oana Ignat; Santiago Castro; Weiji Li; Rada Mihalcea

arXiv:2309.06219·cs.CV·June 21, 2024

Human Action Co-occurrence in Lifestyle Vlogs using Graph Link Prediction

Oana Ignat, Santiago Castro, Weiji Li, Rada Mihalcea

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new dataset and models for automatically identifying co-occurring human actions in lifestyle videos, leveraging graph link prediction techniques to improve understanding of action relationships.

Contribution

The paper presents the ACE dataset and graph-based models for action co-occurrence prediction, advancing the understanding of action relations in videos.

Findings

01

Graphs effectively capture relations between human actions.

02

Graph representations improve co-occurrence prediction accuracy.

03

The ACE dataset enables further research in action understanding.

Abstract

We introduce the task of automatic human action co-occurrence identification, i.e., determine whether two human actions can co-occur in the same interval of time. We create and make publicly available the ACE (Action Co-occurrencE) dataset, consisting of a large graph of ~12k co-occurring pairs of visual actions and their corresponding video clips. We describe graph link prediction models that leverage visual and textual information to automatically infer if two actions are co-occurring. We show that graphs are particularly well suited to capture relations between human actions, and the learned graph representations are effective for our task and capture novel and relevant information across different data domains. The ACE dataset and the code introduced in this paper are publicly available at https://github.com/MichiganNLP/vlog_action_co-occurrence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

michigannlp/vlog_action_co-occurrence
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Human Pose and Action Recognition · Artificial Intelligence in Healthcare