Leveraging Transitive Relations for Crowdsourced Joins
Jiannan Wang, Guoliang Li, Tim Kraska, Michael J. Franklin, and Jianhua Feng

TL;DR
This paper introduces a hybrid approach that leverages transitive relations to minimize crowdsourcing efforts in join queries, significantly reducing costs and time while maintaining high result quality.
Contribution
It proposes a novel framework that utilizes transitive relations to optimize crowdsourced join labeling, including an optimal and heuristic labeling order, with efficient parallel algorithms.
Findings
Reduces crowdsourcing costs by leveraging transitive relations.
Achieves significant time savings with minimal quality loss.
Validated through simulations and real crowdsourcing experiments.
Abstract
The development of crowdsourced query processing systems has recently attracted a significant attention in the database community. A variety of crowdsourced queries have been investigated. In this paper, we focus on the crowdsourced join query which aims to utilize humans to find all pairs of matching objects from two collections. As a human-only solution is expensive, we adopt a hybrid human-machine approach which first uses machines to generate a candidate set of matching pairs, and then asks humans to label the pairs in the candidate set as either matching or non-matching. Given the candidate pairs, existing approaches will publish all pairs for verification to a crowdsourcing platform. However, they neglect the fact that the pairs satisfy transitive relations. As an example, if matches with , and matches with , then we can deduce that matches with …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Privacy-Preserving Technologies in Data · Data Management and Algorithms
