Graphing the Future: Activity and Next Active Object Prediction using   Graph-based Activity Representations

Victoria Manousaki; Konstantinos Papoutsakis; Antonis Argyros

arXiv:2209.05194·cs.CV·September 13, 2022

Graphing the Future: Activity and Next Active Object Prediction using Graph-based Activity Representations

Victoria Manousaki, Konstantinos Papoutsakis, Antonis Argyros

PDF

Open Access

TL;DR

This paper introduces a graph-based method for predicting ongoing human-object interactions and future active objects in videos, achieving high accuracy without modeling detailed motion or contact points.

Contribution

It proposes a novel graph matching approach using Graph Edit distance for classifying interactions and forecasting next active objects in video sequences.

Findings

01

High accuracy in action prediction on MSR Daily Activities dataset

02

Effective forecasting of next active objects on CAD120 dataset

03

Demonstrates the potential of graph-based methods for future interaction prediction

Abstract

We present a novel approach for the visual prediction of human-object interactions in videos. Rather than forecasting the human and object motion or the future hand-object contact points, we aim at predicting (a)the class of the on-going human-object interaction and (b) the class(es) of the next active object(s) (NAOs), i.e., the object(s) that will be involved in the interaction in the near future as well as the time the interaction will occur. Graph matching relies on the efficient Graph Edit distance (GED) method. The experimental evaluation of the proposed approach was conducted using two well-established video datasets that contain human-object interactions, namely the MSR Daily Activities and the CAD120. High prediction accuracy was obtained for both action prediction and NAO forecasting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Advanced Graph Neural Networks · Multimodal Machine Learning Applications