Learning about individuals from group statistics

Hendrik Kuck; Nando de Freitas

arXiv:1207.1393·cs.LG·July 9, 2012·35 cites

Learning about individuals from group statistics

Hendrik Kuck, Nando de Freitas

PDF

Open Access

TL;DR

This paper introduces a probabilistic approach to infer individual labels from group-level statistics, advancing multiple-instance learning by estimating individual classifications from aggregate data.

Contribution

It presents a novel probabilistic model and an efficient MCMC training algorithm for learning individual labels from group statistics, improving upon existing methods.

Findings

01

Effective on synthetic data

02

Demonstrated on real-world object recognition data

03

Outperforms baseline methods

Abstract

We propose a new problem formulation which is similar to, but more informative than, the binary multiple-instance learning problem. In this setting, we are given groups of instances (described by feature vectors) along with estimates of the fraction of positively-labeled instances per group. The task is to learn an instance level classifier from this information. That is, we are trying to estimate the unknown binary labels of individuals from knowledge of group statistics. We propose a principled probabilistic model to solve this problem that accounts for uncertainty in the parameters and in the unknown individual labels. This model is trained with an efficient MCMC algorithm. Its performance is demonstrated on both synthetic and real-world data arising in general object recognition.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Digital Imaging for Blood Diseases · Machine Learning and Data Classification