Do You See What I See? Capabilities and Limits of Automated Multimedia   Content Analysis

Carey Shenkman; Dhanaraj Thakur; Emma Llans\'o

arXiv:2201.11105·cs.MM·January 27, 2022·26 cites

Do You See What I See? Capabilities and Limits of Automated Multimedia Content Analysis

Carey Shenkman, Dhanaraj Thakur, Emma Llans\'o

PDF

Open Access

TL;DR

This paper reviews the capabilities and limitations of automated multimedia content analysis tools, emphasizing their roles in content moderation, policy debates, and the risks of large-scale deployment without understanding their constraints.

Contribution

It provides a comprehensive overview of matching and predictive models used in multimedia analysis, highlighting potential risks and limitations for policy and practical applications.

Findings

01

Matching models include cryptographic and perceptual hashing.

02

Predictive models involve machine learning techniques like computer vision and audition.

03

Automated tools have significant limitations and risks when used at scale.

Abstract

The ever-increasing amount of user-generated content online has led, in recent years, to an expansion in research and investment in automated content analysis tools. Scrutiny of automated content analysis has accelerated during the COVID-19 pandemic, as social networking services have placed a greater reliance on these tools due to concerns about health risks to their moderation staff from in-person work. At the same time, there are important policy debates around the world about how to improve content moderation while protecting free expression and privacy. In order to advance these debates, we need to understand the potential role of automated content analysis tools. This paper explains the capabilities and limitations of tools for analyzing online multimedia content and highlights the potential risks of using these tools at scale without accounting for their limitations. It focuses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Sentiment Analysis and Opinion Mining · Misinformation and Its Impacts