White Paper: Challenges and Considerations for the Creation of a Large   Labelled Repository of Online Videos with Questionable Content

Thamar Solorio; Mahsa Shafaei; Christos Smailis; Mona Diab; Theodore; Giannakopoulos; Heng Ji; Yang Liu; Rada Mihalcea; Smaranda Muresan; Ioannis; Kakadiaris

arXiv:2101.10894·cs.CV·January 27, 2021

White Paper: Challenges and Considerations for the Creation of a Large Labelled Repository of Online Videos with Questionable Content

Thamar Solorio, Mahsa Shafaei, Christos Smailis, Mona Diab, Theodore, Giannakopoulos, Heng Ji, Yang Liu, Rada Mihalcea, Smaranda Muresan, Ioannis, Kakadiaris

PDF

Open Access

TL;DR

This white paper discusses key challenges and considerations in creating a large, annotated online video repository with questionable content, focusing on labeling, collection, annotation, distribution, and annotator safety.

Contribution

It provides a comprehensive overview of the critical factors and best practices for developing a valuable and ethically responsible video dataset for AI research.

Findings

01

Identifies suitable labels for questionable content

02

Recommends collection and annotation strategies

03

Highlights measures to protect annotator well-being

Abstract

This white paper presents a summary of the discussions regarding critical considerations to develop an extensive repository of online videos annotated with labels indicating questionable content. The main discussion points include: 1) the type of appropriate labels that will result in a valuable repository for the larger AI community; 2) how to design the collection and annotation process, as well as the distribution of the corpus to maximize its potential impact; and, 3) what actions we can take to reduce risk of trauma to annotators.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization · Digital Games and Media · Multimedia Communication and Technology