Exploring Visual Engagement Signals for Representation Learning

Menglin Jia; Zuxuan Wu; Austin Reiter; Claire Cardie; Serge Belongie,; Ser-Nam Lim

arXiv:2104.07767·cs.CV·August 17, 2021

Exploring Visual Engagement Signals for Representation Learning

Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie,, Ser-Nam Lim

PDF

Open Access 1 Repo

TL;DR

This paper introduces VisE, a weakly supervised learning method that uses social media engagement signals as supervisory cues to improve representation learning for various computer vision tasks.

Contribution

It presents a novel approach to leverage social engagement signals as pseudo labels for training visual models, bridging the gap between visual data and social interactions.

Findings

01

VisE improves performance on emotion recognition tasks.

02

Models trained with VisE outperform traditional methods on political bias detection.

03

The approach demonstrates versatility across multiple classification tasks.

Abstract

Visual engagement in social media platforms comprises interactions with photo posts including comments, shares, and likes. In this paper, we leverage such visual engagement clues as supervisory signals for representation learning. However, learning from engagement signals is non-trivial as it is not clear how to bridge the gap between low-level visual information and high-level social interactions. We present VisE, a weakly supervised learning approach, which maps social images to pseudo labels derived by clustered engagement signals. We then study how models trained in this way benefit subjective downstream computer vision tasks such as emotion recognition or political bias detection. Through extensive studies, we empirically demonstrate the effectiveness of VisE across a diverse set of classification tasks beyond the scope of conventional recognition.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KMnP/vise
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Sentiment Analysis and Opinion Mining