Quantifying the visual concreteness of words and topics in multimodal   datasets

Jack Hessel; David Mimno; Lillian Lee

arXiv:1804.06786·cs.CL·May 25, 2018

Quantifying the visual concreteness of words and topics in multimodal datasets

Jack Hessel, David Mimno, Lillian Lee

PDF

1 Repo

TL;DR

This paper introduces an algorithm to measure the visual concreteness of words and topics in multimodal datasets, revealing that concrete concepts are generally easier for algorithms to learn and providing insights for future research.

Contribution

The authors present a novel method for automatically quantifying visual concreteness in multimodal datasets, enabling better understanding and analysis of concept learnability.

Findings

01

Concrete concepts are easier for algorithms to learn.

02

Different algorithms exhibit similar failure cases.

03

The relationship between concreteness and performance varies across datasets.

Abstract

Multimodal machine learning algorithms aim to learn visual-textual correspondences. Previous work suggests that concepts with concrete visual manifestations may be easier to learn than concepts with abstract ones. We give an algorithm for automatically computing the visual concreteness of words and topics within multimodal datasets. We apply the approach in four settings, ranging from image captions to images/text scraped from historical books. In addition to enabling explorations of concepts in multimodal datasets, our concreteness scores predict the capacity of machine learning algorithms to learn textual/visual relationships. We find that 1) concrete concepts are indeed easier to learn; 2) the large number of algorithms we consider have similar failure cases; 3) the precise positive relationship between concreteness and performance varies between datasets. We conclude with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

victorssilva/concreteness
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.