Charades-Ego: A Large-Scale Dataset of Paired Third and First Person   Videos

Gunnar A. Sigurdsson; Abhinav Gupta; Cordelia Schmid; Ali Farhadi,; Karteek Alahari

arXiv:1804.09626·cs.CV·May 1, 2018·97 cites

Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos

Gunnar A. Sigurdsson, Abhinav Gupta, Cordelia Schmid, Ali Farhadi,, Karteek Alahari

PDF

Open Access

TL;DR

Charades-Ego is a large, diverse dataset linking first and third-person videos with detailed annotations, enabling advancements in egocentric video understanding and cross-modal tasks.

Contribution

It introduces a comprehensive egocentric video dataset with extensive annotations, expanding resources for egocentric and cross-modal video research.

Findings

01

Largest egocentric dataset with 68,536 activity instances

02

Includes temporal annotations and textual descriptions

03

Facilitates research in classification, localization, and captioning

Abstract

In Actor and Observer we introduced a dataset linking the first and third-person video understanding domains, the Charades-Ego Dataset. In this paper we describe the egocentric aspect of the dataset and present annotations for Charades-Ego with 68,536 activity instances in 68.8 hours of first and third-person video, making it one of the largest and most diverse egocentric datasets available. Charades-Ego furthermore shares activity classes, scripts, and methodology with the Charades dataset, that consist of additional 82.3 hours of third-person video with 66,500 activity instances. Charades-Ego has temporal annotations and textual descriptions, making it suitable for egocentric video classification, localization, captioning, and new tasks utilizing the cross-modal nature of the data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Video Analysis and Summarization