# Kernel Mean Embedding of Instance-wise Predictions in Multiple Instance   Regression

**Authors:** Thomas Uriot

arXiv: 1904.10583 · 2019-08-20

## TL;DR

This paper introduces an improved multiple instance regression method that uses kernel mean embeddings of predicted label distributions, leading to better performance than previous approaches on real-world datasets.

## Contribution

The paper extends the instance-MIR algorithm by incorporating kernel mean embeddings to better capture label distribution information within bags.

## Key findings

- Outperforms baseline instance-MIR on five datasets
- Achieves state-of-the-art results on two datasets
- Provides a more expressive representation of bag label distributions

## Abstract

In this paper, we propose an extension to an existing algorithm (instance-MIR) which tackles the multiple instance regression (MIR) problem, also known as distribution regression. The MIR setting arises when the data is a collection of bags, where each bag consists of several instances which correspond to the same and unique real-valued label. The goal of a MIR algorithm is to find a mapping from the instances of an unseen bag to its target value. The instance-MIR algorithm treats all the instances separately and maps each instance to a label. The final bag label is then taken as the mean or the median of the predictions for that given bag. While it is conceptually simple, taking a single statistic to summarize the distribution of the labels in each bag is a limitation. In spite of this performance bottleneck, the instance-MIR algorithm has been shown to be competitive when compared to the current state-of-the-art methods. We address the aforementioned issue by computing the kernel mean embeddings of the distributions of the predicted labels, for each bag, and learn a regressor from these embeddings to the bag label. We test our algorithm (instance-kme-MIR) on five real world datasets and obtain better results than the baseline instance-MIR across all the datasets, while achieving state-of-the-art results on two of the datasets.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.10583/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1904.10583/full.md

## References

24 references — full list in the complete paper: https://tomesphere.com/paper/1904.10583/full.md

---
Source: https://tomesphere.com/paper/1904.10583