Multimodal Deep Learning Framework for Image Popularity Prediction on   Social Media

Fatma S. Abousaleh; Wen-Huang Cheng; Neng-Hao Yu; and Yu Tsao

arXiv:2105.08809·cs.CV·May 20, 2021

Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media

Fatma S. Abousaleh, Wen-Huang Cheng, Neng-Hao Yu, and Yu Tsao

PDF

TL;DR

This paper introduces VSCNN, a deep learning model that combines visual and social features to accurately predict image popularity on social media, outperforming existing methods.

Contribution

The study proposes a novel multimodal deep learning framework, VSCNN, integrating visual and social data for improved image popularity prediction.

Findings

01

VSCNN outperforms state-of-the-art models in predicting image popularity.

02

The model achieves over 14% improvement in mean squared error.

03

Extensive experiments on 432K images validate the effectiveness of the approach.

Abstract

Billions of photos are uploaded to the web daily through various types of social networks. Some of these images receive millions of views and become popular, whereas others remain completely unnoticed. This raises the problem of predicting image popularity on social media. The popularity of an image can be affected by several factors, such as visual content, aesthetic quality, user, post metadata, and time. Thus, considering all these factors is essential for accurately predicting image popularity. In addition, the efficiency of the predictive model also plays a crucial role. In this study, motivated by multimodal learning, which uses information from various modalities, and the current success of convolutional neural networks (CNNs) in various fields, we propose a deep learning model, called visual-social convolutional neural network (VSCNN), which predicts the popularity of a posted…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.