Discovering Attribute Shades of Meaning with the Crowd
Adriana Kovashka, Kristen Grauman

TL;DR
This paper introduces a method to discover different shades of meaning for visual attributes using crowdsourced labels, enabling more accurate and personalized attribute recognition and image retrieval.
Contribution
It proposes a novel approach to uncover latent interpretative shades of attributes from crowd labels, improving visual attribute modeling and search.
Findings
Latent attribute shades improve classification accuracy.
Shades enable personalized and robust image retrieval.
Method applicable to zero-shot learning and photo organization.
Abstract
To learn semantic attributes, existing methods typically train one discriminative model for each word in a vocabulary of nameable properties. However, this "one model per word" assumption is problematic: while a word might have a precise linguistic definition, it need not have a precise visual definition. We propose to discover shades of attribute meaning. Given an attribute name, we use crowdsourced image labels to discover the latent factors underlying how different annotators perceive the named concept. We show that structure in those latent factors helps reveal shades, that is, interpretations for the attribute shared by some group of annotators. Using these shades, we train classifiers to capture the primary (often subtle) variants of the attribute. The resulting models are both semantic and visually precise. By catering to users' interpretations, they improve attribute prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
