White Paper: Challenges and Considerations for the Creation of a Large Labelled Repository of Online Videos with Questionable Content
Thamar Solorio, Mahsa Shafaei, Christos Smailis, Mona Diab, Theodore, Giannakopoulos, Heng Ji, Yang Liu, Rada Mihalcea, Smaranda Muresan, Ioannis, Kakadiaris

TL;DR
This white paper discusses key challenges and considerations in creating a large, annotated online video repository with questionable content, focusing on labeling, collection, annotation, distribution, and annotator safety.
Contribution
It provides a comprehensive overview of the critical factors and best practices for developing a valuable and ethically responsible video dataset for AI research.
Findings
Identifies suitable labels for questionable content
Recommends collection and annotation strategies
Highlights measures to protect annotator well-being
Abstract
This white paper presents a summary of the discussions regarding critical considerations to develop an extensive repository of online videos annotated with labels indicating questionable content. The main discussion points include: 1) the type of appropriate labels that will result in a valuable repository for the larger AI community; 2) how to design the collection and annotation process, as well as the distribution of the corpus to maximize its potential impact; and, 3) what actions we can take to reduce risk of trauma to annotators.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Digital Games and Media · Multimedia Communication and Technology
