# One-element Batch Training by Moving Window

**Authors:** Przemys{\l}aw Spurek, Szymon Knop, Jacek Tabor, Igor Podolak, Bartosz, W\'ojcik

arXiv: 1905.12947 · 2019-06-03

## TL;DR

This paper introduces a novel training method for deep generative models that enables effective learning using only one-element mini-batches by splitting data into historical and current parts, reducing memory usage and allowing higher resolution training.

## Contribution

The paper presents a generic approach to train models with one-element batches by splitting data in latent space, facilitating higher resolution training and broadening applicability.

## Key findings

- Enables training with one-element mini-batches.
- Reduces memory requirements for training.
- Allows higher resolution image training.

## Abstract

Several deep models, esp. the generative, compare the samples from two distributions (e.g. WAE like AutoEncoder models, set-processing deep networks, etc) in their cost functions. Using all these methods one cannot train the model directly taking small size (in extreme -- one element) batches, due to the fact that samples are to be compared.   We propose a generic approach to training such models using one-element mini-batches. The idea is based on splitting the batch in latent into parts: previous, i.e. historical, elements used for latent space distribution matching and the current ones, used both for latent distribution computation and the minimization process. Due to the smaller memory requirements, this allows to train networks on higher resolution images then in the classical approach.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.12947/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1905.12947/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/1905.12947/full.md

---
Source: https://tomesphere.com/paper/1905.12947