# Deep Archetypal Analysis

**Authors:** Sebastian Mathias Keller, Maxim Samarin, Mario Wieser, Volker Roth

arXiv: 1901.10799 · 2020-01-27

## TL;DR

Deep Archetypal Analysis extends traditional archetypal analysis with deep learning to generate interpretable latent representations, handle side information, and explore data in biological, chemical, and image domains.

## Contribution

It introduces a deep learning extension of archetypal analysis that incorporates side information and enables data-driven, interpretable representations across various datasets.

## Key findings

- Identifies archetypal faces in CelebA dataset.
- Generates new faces by combining archetypes.
- Facilitates exploration of chemical space for molecular design.

## Abstract

"Deep Archetypal Analysis" generates latent representations of high-dimensional datasets in terms of fractions of intuitively understandable basic entities called archetypes. The proposed method is an extension of linear "Archetypal Analysis" (AA), an unsupervised method to represent multivariate data points as sparse convex combinations of extremal elements of the dataset. Unlike the original formulation of AA, "Deep AA" can also handle side information and provides the ability for data-driven representation learning which reduces the dependence on expert knowledge. Our method is motivated by studies of evolutionary trade-offs in biology where archetypes are species highly adapted to a single task. Along these lines, we demonstrate that "Deep AA" also lends itself to the supervised exploration of chemical space, marking a distinct starting point for de novo molecular design. In the unsupervised setting we show how "Deep AA" is used on CelebA to identify archetypal faces. These can then be superimposed in order to generate new faces which inherit dominant traits of the archetypes they are based on.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.10799/full.md

## Figures

19 figures with captions in the complete paper: https://tomesphere.com/paper/1901.10799/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/1901.10799/full.md

---
Source: https://tomesphere.com/paper/1901.10799