Affective Computing for Large-Scale Heterogeneous Multimedia Data: A Survey
Sicheng Zhao, Shangfei Wang, Mohammad Soleymani, Dhiraj Joshi, Qiang, Ji

TL;DR
This survey reviews current affective computing techniques for large-scale heterogeneous multimedia data, covering emotion models, datasets, methods, and future challenges in understanding human emotions through multimedia analysis.
Contribution
It provides a comprehensive overview of state-of-the-art AC technologies for multimedia data, including both traditional and deep learning approaches, and discusses future research directions.
Findings
Comparison of handcrafted and deep learning methods
Summary of datasets for multimedia affective computing
Discussion of challenges and future directions
Abstract
The wide popularity of digital photography and social networks has generated a rapidly growing volume of multimedia data (i.e., image, music, and video), resulting in a great demand for managing, retrieving, and understanding these data. Affective computing (AC) of these data can help to understand human behaviors and enable wide applications. In this article, we survey the state-of-the-art AC technologies comprehensively for large-scale heterogeneous multimedia data. We begin this survey by introducing the typical emotion representation models from psychology that are widely employed in AC. We briefly describe the available datasets for evaluating AC algorithms. We then summarize and compare the representative methods on AC of different multimedia types, i.e., images, music, videos, and multimodal data, with the focus on both handcrafted features-based methods and deep learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
