Predicting Audio Advertisement Quality
Samaneh Ebrahimi, Hossein Vahabi, Matthew Prockup, Oriol Nieto

TL;DR
This paper introduces a deep learning approach to automatically predict the quality of online audio ads based on acoustic features, aiming to enhance ad ranking and creation for better user engagement.
Contribution
It presents the first large-scale study on audio ad quality prediction using raw waveform features and a novel deep learning model outperforming traditional methods.
Findings
Deep learning model outperforms models with hand-crafted features
Acoustic features like harmony, rhythm, and timbre relate to ad quality
Proposed proxy metric LCR effectively measures ad engagement
Abstract
Online audio advertising is a particular form of advertising used abundantly in online music streaming services. In these platforms, which tend to host tens of thousands of unique audio advertisements (ads), providing high quality ads ensures a better user experience and results in longer user engagement. Therefore, the automatic assessment of these ads is an important step toward audio ads ranking and better audio ads creation. In this paper we propose one way to measure the quality of the audio ads using a proxy metric called Long Click Rate (LCR), which is defined by the amount of time a user engages with the follow-up display ad (that is shown while the audio ad is playing) divided by the impressions. We later focus on predicting the audio ad quality using only acoustic features such as harmony, rhythm, and timbre of the audio, extracted from the raw waveform. We discuss how the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
