Multi-source Hierarchical Prediction Consolidation

Chenwei Zhang; Sihong Xie; Yaliang Li; Jing Gao; Wei Fan; Philip S. Yu

arXiv:1608.03344·cs.DB·August 12, 2016

Multi-source Hierarchical Prediction Consolidation

Chenwei Zhang, Sihong Xie, Yaliang Li, Jing Gao, Wei Fan, Philip S. Yu

PDF

Open Access

TL;DR

This paper introduces a novel method for consolidating predictions from multiple sources within hierarchical label structures, effectively handling noisy, conflicting data in complex real-world applications like healthcare.

Contribution

It proposes a new hierarchical prediction consolidation approach with a closed-form solution, leveraging label hierarchies to improve aggregation accuracy in noisy multi-source environments.

Findings

01

Outperforms existing methods on synthetic datasets.

02

Effective in real-world healthcare data scenarios.

03

Handles noisy, conflicting multi-source predictions efficiently.

Abstract

In big data applications such as healthcare data mining, due to privacy concerns, it is necessary to collect predictions from multiple information sources for the same instance, with raw features being discarded or withheld when aggregating multiple predictions. Besides, crowd-sourced labels need to be aggregated to estimate the ground truth of the data. Because of the imperfect predictive models or human crowdsourcing workers, noisy and conflicting information is ubiquitous and inevitable. Although state-of-the-art aggregation methods have been proposed to handle label spaces with flat structures, as the label space is becoming more and more complicated, aggregation under a label hierarchical structure becomes necessary but has been largely ignored. These label hierarchies can be quite informative as they are usually created by domain experts to make sense of highly complex label…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · Music and Audio Processing