Do You See What I See? Capabilities and Limits of Automated Multimedia Content Analysis
Carey Shenkman, Dhanaraj Thakur, Emma Llans\'o

TL;DR
This paper reviews the capabilities and limitations of automated multimedia content analysis tools, emphasizing their roles in content moderation, policy debates, and the risks of large-scale deployment without understanding their constraints.
Contribution
It provides a comprehensive overview of matching and predictive models used in multimedia analysis, highlighting potential risks and limitations for policy and practical applications.
Findings
Matching models include cryptographic and perceptual hashing.
Predictive models involve machine learning techniques like computer vision and audition.
Automated tools have significant limitations and risks when used at scale.
Abstract
The ever-increasing amount of user-generated content online has led, in recent years, to an expansion in research and investment in automated content analysis tools. Scrutiny of automated content analysis has accelerated during the COVID-19 pandemic, as social networking services have placed a greater reliance on these tools due to concerns about health risks to their moderation staff from in-person work. At the same time, there are important policy debates around the world about how to improve content moderation while protecting free expression and privacy. In order to advance these debates, we need to understand the potential role of automated content analysis tools. This paper explains the capabilities and limitations of tools for analyzing online multimedia content and highlights the potential risks of using these tools at scale without accounting for their limitations. It focuses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Sentiment Analysis and Opinion Mining · Misinformation and Its Impacts
