# Closed-form Expressions for Maximum Mean Discrepancy with Applications   to Wasserstein Auto-Encoders

**Authors:** Raif M. Rustamov

arXiv: 1901.03227 · 2020-06-03

## TL;DR

This paper derives closed-form formulas for Gaussian kernel MMD, linking it to the BHEP statistic, and introduces normalization techniques to improve Wasserstein Auto-Encoder training and interpretability.

## Contribution

It provides the first closed-form expressions for MMD with Gaussian kernels and applies these to enhance WAE training stability and interpretability.

## Key findings

- Closed-form MMD formulas improve estimation accuracy.
- Standardized MMD enhances interpretability and hyperparameter tuning.
- Batch normalization at the code layer benefits WAE training.

## Abstract

The Maximum Mean Discrepancy (MMD) has found numerous applications in statistics and machine learning, most recently as a penalty in the Wasserstein Auto-Encoder (WAE). In this paper we compute closed-form expressions for estimating the Gaussian kernel based MMD between a given distribution and the standard multivariate normal distribution. This formula reveals a connection to the Baringhaus-Henze-Epps-Pulley (BHEP) statistic of the Henze-Zirkler test and provides further insights about the MMD. We introduce the standardized version of MMD as a penalty for the WAE training objective, allowing for a better interpretability of MMD values and more compatibility across different hyperparameter settings. Next, we propose using a version of batch normalization at the code layer; this has the benefits of making the kernel width selection easier, reducing the training effort, and preventing outliers in the aggregate code distribution. Our experiments on synthetic and real data show that the analytic formulation improves over the commonly used stochastic approximation of the MMD, and demonstrate that code normalization provides significant benefits when training WAEs.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.03227/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/1901.03227/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/1901.03227/full.md

---
Source: https://tomesphere.com/paper/1901.03227