TL;DR
This paper analyzes the use of negations in image descriptions, categorizes their functions, and discusses requirements for systems to generate such negations, highlighting the importance of subjective language in image captioning.
Contribution
It provides a qualitative categorization of negation use in image descriptions and proposes system requirements for generating negations.
Findings
Negations are used in various contextual ways in image descriptions.
A categorization scheme for negation use was developed.
Manual annotation of negations achieved an agreement score of K=0.67.
Abstract
We provide a qualitative analysis of the descriptions containing negations (no, not, n't, nobody, etc) in the Flickr30K corpus, and a categorization of negation uses. Based on this analysis, we provide a set of requirements that an image description system should have in order to generate negation sentences. As a pilot experiment, we used our categorization to manually annotate sentences containing negations in the Flickr30K corpus, with an agreement score of K=0.67. With this paper, we hope to open up a broader discussion of subjective language in image descriptions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
