Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Arnav Chakravarthy, Zhiyuan Fang, Yezhou Yang

TL;DR
This paper introduces the W-Oops dataset of unintentional human action videos and proposes a weakly supervised learning method to localize goal-directed and unintentional activities, enhancing AI understanding of human goals.
Contribution
The paper creates the W-Oops dataset and develops a novel attention-based weakly supervised algorithm for localizing goal-directed and unintentional actions in videos.
Findings
Effective localization of goal-directed and unintentional activities demonstrated
Improved teleological understanding in video captioning tasks
Validation of the proposed method through extensive experiments
Abstract
In videos that contain actions performed unintentionally, agents do not achieve their desired goals. In such videos, it is challenging for computer vision systems to understand high-level concepts such as goal-directed behavior, an ability present in humans from a very early age. Inculcating this ability in artificially intelligent agents would make them better social learners by allowing them to evaluate human action under a teleological lens. To validate the ability of deep learning models to perform this task, we curate the W-Oops dataset, built upon the Oops dataset [15]. W-Oops consists of 2,100 unintentional human action videos, with 44 goal-directed and 30 unintentional video-level activity labels collected through human annotations. Due to the expensive segment annotation procedure, we propose a weakly supervised algorithm for localizing the goal-directed as well as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Anomaly Detection Techniques and Applications
