Object State Change Classification in Egocentric Videos using the   Divided Space-Time Attention Mechanism

Md Mohaiminul Islam; Gedas Bertasius

arXiv:2207.11814·cs.CV·January 5, 2023·1 cites

Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism

Md Mohaiminul Islam, Gedas Bertasius

PDF

Open Access 1 Repo

TL;DR

This paper introduces a transformer-based model utilizing Divided Space-Time Attention for classifying object state changes in egocentric videos, achieving high performance in the Ego4D challenge and emphasizing the importance of temporal modeling.

Contribution

The paper presents a novel application of Divided Space-Time Attention in a transformer model for egocentric object state change classification, with comprehensive ablation studies.

Findings

01

Achieved second-best performance in the Ego4D challenge.

02

Temporal modeling is crucial for accurate object state change classification.

03

Visualization of model predictions demonstrates interpretability.

Abstract

This report describes our submission called "TarHeels" for the Ego4D: Object State Change Classification Challenge. We use a transformer-based video recognition model and leverage the Divided Space-Time Attention mechanism for classifying object state change in egocentric videos. Our submission achieves the second-best performance in the challenge. Furthermore, we perform an ablation study to show that identifying object state change in egocentric videos requires temporal modeling ability. Lastly, we present several positive and negative examples to visualize our model's predictions. The code is publicly available at: https://github.com/md-mohaiminul/ObjectStateChange

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

md-mohaiminul/objectstatechange
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health Research Topics