Robust Subjective Visual Property Prediction from Crowdsourced Pairwise Labels
Yanwei Fu, Timothy M. Hospedales, Tao Xiang, Jiechao Xiong, Shaogang, Gong, Yizhou Wang, and Yuan Yao

TL;DR
This paper introduces a robust learning-to-rank framework for predicting subjective visual properties from crowdsourced pairwise labels, effectively identifying outliers and enabling accurate predictions with sparse data.
Contribution
It proposes a unified approach that jointly detects annotation outliers and learns to rank, improving robustness over existing majority voting methods.
Findings
Outperforms state-of-the-art methods on benchmark datasets.
Effectively detects annotation outliers in crowdsourced data.
Achieves accurate predictions with fewer annotations.
Abstract
The problem of estimating subjective visual properties from image and video has attracted increasing interest. A subjective visual property is useful either on its own (e.g. image and video interestingness) or as an intermediate representation for visual recognition (e.g. a relative attribute). Due to its ambiguous nature, annotating the value of a subjective visual property for learning a prediction model is challenging. To make the annotation more reliable, recent studies employ crowdsourcing tools to collect pairwise comparison labels because human annotators are much better at ranking two images/videos (e.g. which one is more interesting) than giving an absolute value to each of them separately. However, using crowdsourced data also introduces outliers. Existing methods rely on majority voting to prune the annotation outliers/errors. They thus require large amount of pairwise labels…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
