Self-supervised Multi-actor Social Activity Understanding in Streaming   Videos

Shubham Trehan; Sathyanarayanan N. Aakur

arXiv:2406.14472·cs.CV·June 21, 2024

Self-supervised Multi-actor Social Activity Understanding in Streaming Videos

Shubham Trehan, Sathyanarayanan N. Aakur

PDF

Open Access

TL;DR

This paper introduces a self-supervised multi-actor social activity recognition method for streaming videos that models social interactions with minimal labeled data, outperforming traditional approaches in real-world scenarios.

Contribution

It proposes a novel self-supervised framework using a visual-semantic graph for modeling social interactions in streaming videos, reducing reliance on annotated data.

Findings

01

Achieves competitive results on standard group activity benchmarks.

02

Demonstrates strong generalization across multiple action localization datasets.

03

Effectively models social interactions with minimal supervision.

Abstract

This work addresses the problem of Social Activity Recognition (SAR), a critical component in real-world tasks like surveillance and assistive robotics. Unlike traditional event understanding approaches, SAR necessitates modeling individual actors' appearance and motions and contextualizing them within their social interactions. Traditional action localization methods fall short due to their single-actor, single-action assumption. Previous SAR research has relied heavily on densely annotated data, but privacy concerns limit their applicability in real-world settings. In this work, we propose a self-supervised approach based on multi-actor predictive learning for SAR in streaming videos. Using a visual-semantic graph structure, we model social interactions, enabling relational reasoning for robust performance with minimal labeled data. The proposed framework achieves competitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization