Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media
Fatma S. Abousaleh, Wen-Huang Cheng, Neng-Hao Yu, and Yu Tsao

TL;DR
This paper introduces VSCNN, a deep learning model that combines visual and social features to accurately predict image popularity on social media, outperforming existing methods.
Contribution
The study proposes a novel multimodal deep learning framework, VSCNN, integrating visual and social data for improved image popularity prediction.
Findings
VSCNN outperforms state-of-the-art models in predicting image popularity.
The model achieves over 14% improvement in mean squared error.
Extensive experiments on 432K images validate the effectiveness of the approach.
Abstract
Billions of photos are uploaded to the web daily through various types of social networks. Some of these images receive millions of views and become popular, whereas others remain completely unnoticed. This raises the problem of predicting image popularity on social media. The popularity of an image can be affected by several factors, such as visual content, aesthetic quality, user, post metadata, and time. Thus, considering all these factors is essential for accurately predicting image popularity. In addition, the efficiency of the predictive model also plays a crucial role. In this study, motivated by multimodal learning, which uses information from various modalities, and the current success of convolutional neural networks (CNNs) in various fields, we propose a deep learning model, called visual-social convolutional neural network (VSCNN), which predicts the popularity of a posted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
