Crowdsourced Truth Discovery in the Presence of Hierarchies for   Knowledge Fusion

Woohwan Jung; Younghoon Kim; Kyuseok Shim

arXiv:1904.10217·cs.DB·April 24, 2019·1 cites

Crowdsourced Truth Discovery in the Presence of Hierarchies for Knowledge Fusion

Woohwan Jung, Younghoon Kim, Kyuseok Shim

PDF

Open Access

TL;DR

This paper introduces a probabilistic model for truth discovery that accounts for hierarchical structures in data and leverages crowdsourcing to improve accuracy in knowledge fusion from unstructured sources.

Contribution

It presents a novel hierarchical-aware truth discovery model combined with a crowdsourcing task assignment algorithm, enhancing accuracy in knowledge fusion tasks.

Findings

01

Effective truth inference with hierarchical data structures

02

Crowdsourcing improves the accuracy of unstructured data claims

03

Proposed algorithms outperform baseline methods in real datasets

Abstract

Existing works for truth discovery in categorical data usually assume that claimed values are mutually exclusive and only one among them is correct. However, many claimed values are not mutually exclusive even for functional predicates due to their hierarchical structures. Thus, we need to consider the hierarchical structure to effectively estimate the trustworthiness of the sources and infer the truths. We propose a probabilistic model to utilize the hierarchical structures and an inference algorithm to find the truths. In addition, in the knowledge fusion, the step of automatically extracting information from unstructured data (e.g., text) generates a lot of false claims. To take advantages of the human cognitive abilities in understanding unstructured data, we utilize crowdsourcing to refine the result of the truth discovery. We propose a task assignment algorithm to maximize the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Data Quality and Management · Data Stream Mining Techniques