Discovering Multiple Truths with a Hybrid Model
Furong Li, Xin Luna Dong, Anno Langen, Yang Li

TL;DR
This paper introduces a Hybrid model for data integration that accurately identifies multiple truths from conflicting sources by jointly determining the number of truths and their values, balancing precision and recall.
Contribution
The paper presents a novel Hybrid model that simultaneously infers the number of truths and their values, improving accuracy in multi-truth data integration tasks.
Findings
Achieves high precision and recall in identifying multiple truths.
Effectively handles conflicting and erroneous source data.
Jointly infers the number of truths and their values.
Abstract
Many data management applications require integrating information from multiple sources. The sources may not be accurate and provide erroneous values. We thus have to identify the true values from conflicting observations made by the sources. The problem is further complicated when there may exist multiple truths (e.g., a book written by several authors). In this paper we propose a model called Hybrid that jointly makes two decisions: how many truths there are, and what they are. It considers the conflicts between values as important evidence for ruling out wrong values, while keeps the flexibility of allowing multiple truths. In this way, Hybrid is able to achieve both high precision and high recall.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Imbalanced Data Classification Techniques · Anomaly Detection Techniques and Applications
