# Generating the support with extreme value losses

**Authors:** Nicholas Guttenberg

arXiv: 1902.02940 · 2019-02-11

## TL;DR

This paper introduces a novel loss function based on extreme values to improve generative modeling by capturing the support of target distributions, especially effective in low-dimensional, complex distribution scenarios.

## Contribution

The paper proposes a simple method using extreme value losses to enable models to learn the support of target distributions and generate diverse outputs.

## Key findings

- Models trained with extreme value loss capture the support of target distributions.
- The method effectively reproduces complex, low-dimensional distributions.
- Less effective in high-dimensional spaces due to sample complexity.

## Abstract

When optimizing against the mean loss over a distribution of predictions in the context of a regression task, then even if there is a distribution of targets the optimal prediction distribution is always a delta function at a single value. Methods of constructing generative models need to overcome this tendency. We consider a simple method of summarizing the prediction error, such that the optimal strategy corresponds to outputting a distribution of predictions with a support that matches the support of the distribution of targets --- optimizing against the minimal value of the loss given a set of samples from the prediction distribution, rather than the mean. We show that models trained against this loss learn to capture the support of the target distribution and, when combined with an auxiliary classifier-like prediction task, can be projected via rejection sampling to reproduce the full distribution of targets. The resulting method works well compared to other generative modeling approaches particularly in low dimensional spaces with highly non-trivial distributions, due to mode collapse solutions being globally suboptimal with respect to the extreme value loss. However, the method is less suited to high-dimensional spaces such as images due to the scaling of the number of samples needed in order to accurately estimate the extreme value loss when the dimension of the data manifold becomes large.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.02940/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1902.02940/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/1902.02940/full.md

---
Source: https://tomesphere.com/paper/1902.02940