Separating Self-Expression and Visual Content in Hashtag Supervision

Andreas Veit; Maximilian Nickel; Serge Belongie; Laurens van der; Maaten

arXiv:1711.09825·cs.CV·November 28, 2017

Separating Self-Expression and Visual Content in Hashtag Supervision

Andreas Veit, Maximilian Nickel, Serge Belongie, Laurens van der, Maaten

PDF

1 Repo

TL;DR

This paper introduces a joint modeling approach that separates self-expressive hashtags from visual content, improving image tagging and retrieval by accounting for user-specific hashtag usage.

Contribution

It proposes a novel joint distribution model of images, hashtags, and users to handle hashtag ambiguity and subjectivity in vision tasks.

Findings

01

Enhanced image tagging accuracy

02

Improved user-conditional retrieval

03

Effective handling of hashtag ambiguity

Abstract

The variety, abundance, and structured nature of hashtags make them an interesting data source for training vision models. For instance, hashtags have the potential to significantly reduce the problem of manual supervision and annotation when learning vision models for a large number of concepts. However, a key challenge when learning from hashtags is that they are inherently subjective because they are provided by users as a form of self-expression. As a consequence, hashtags may have synonyms (different hashtags referring to the same visual content) and may be ambiguous (the same hashtag referring to different visual content). These challenges limit the effectiveness of approaches that simply treat hashtags as image-label pairs. This paper presents an approach that extends upon modeling simple image-label pairs by modeling the joint distribution of images, hashtags, and users. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

weiyinwei/GCN_PHR
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.