Automatic Understanding of Image and Video Advertisements
Zaeem Hussain, Mingda Zhang, Xiaozhong Zhang, Keren Ye, Christopher, Thomas, Zuha Agha, Nathan Ong, Adriana Kovashka

TL;DR
This paper introduces the novel problem of automatic advertisement understanding, creating large annotated datasets for images and videos, and analyzing persuasive strategies with baseline classification results.
Contribution
It presents the first datasets with rich annotations for ad understanding and analyzes persuasive strategies, enabling future research in automatic ad comprehension.
Findings
Created datasets with 64,832 image ads and 3,477 video ads with detailed annotations.
Analyzed common persuasive strategies used in advertisements.
Provided baseline classification results for predicting ad messages and strategies.
Abstract
There is more to images than their objective physical content: for example, advertisements are created to persuade a viewer to take a certain action. We propose the novel problem of automatic advertisement understanding. To enable research on this problem, we create two datasets: an image dataset of 64,832 image ads, and a video dataset of 3,477 ads. Our data contains rich annotations encompassing the topic and sentiment of the ads, questions and answers describing what actions the viewer is prompted to take and the reasoning that the ad presents to persuade the viewer ("What should I do according to this ad, and why should I do it?"), and symbolic references ads make (e.g. a dove symbolizes peace). We also analyze the most common persuasive strategies ads use, and the capabilities that computer vision systems should have to understand these strategies. We present baseline…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
