Dialogue Enhancement and Listening Effort in Broadcast Audio: A Multimodal Evaluation
Matteo Torcoli, Thomas Robotham, Emanu\"el A. P. Habets

TL;DR
This study investigates how dialogue enhancement in broadcast audio affects listening effort by measuring pupil size and self-reported effort, demonstrating that improved audio clarity reduces listening effort and pupil dilation.
Contribution
It provides a multimodal evaluation of dialogue enhancement effects on listening effort using pupil size and self-report, including the impact of source separation techniques.
Findings
Dialogue enhancement reduces pupil size and listening effort.
Source separation further improves listening experience.
Multimodal measures correlate with subjective effort reports.
Abstract
Dialogue enhancement (DE) plays a vital role in broadcasting, enabling the personalization of the relative level between foreground speech and background music and effects. DE has been shown to improve the quality of experience, intelligibility, and self-reported listening effort (LE). A physiological indicator of LE known from audiology studies is pupil size. The relation between pupil size and LE is typically studied using artificial sentences and background noises not encountered in broadcast content. This work evaluates the effect of DE on LE in a multimodal manner that includes pupil size (tracked by a VR headset) and real-world audio excerpts from TV. Under ideal listening conditions, 28 normal-hearing participants listened to 30 audio excerpts presented in random order and processed by conditions varying the relative level between foreground and background audio. One of these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Subtitles and Audiovisual Media
