Natural representation of composite data with replicated autoencoders

Matteo Negri; Davide Bergamini; Carlo Baldassi; Riccardo Zecchina,; Christoph Feinauer

arXiv:1909.13327·q-bio.QM·October 4, 2019

Natural representation of composite data with replicated autoencoders

Matteo Negri, Davide Bergamini, Carlo Baldassi, Riccardo Zecchina,, Christoph Feinauer

PDF

Open Access

TL;DR

This paper introduces a novel unsupervised autoencoder-based method that optimizes local entropy through replicated autoencoders, enabling robust inference of basic features in composite biological data without prior knowledge.

Contribution

It presents a new approach using replicated autoencoders and local entropy optimization for better feature inference in complex data.

Findings

01

Successfully infers hidden features correlating with generative processes

02

Enhances robustness and performance over standard autoencoders

03

Applicable to synthetic and protein sequence data

Abstract

Generative processes in biology and other fields often produce data that can be regarded as resulting from a composition of basic features. Here we present an unsupervised method based on autoencoders for inferring these basic features of data. The main novelty in our approach is that the training is based on the optimization of the `local entropy' rather than the standard loss, resulting in a more robust inference, and enhancing the performance on this type of data considerably. Algorithmically, this is realized by training an interacting system of replicated autoencoders. We apply this method to synthetic and protein sequence data, and show that it is able to infer a hidden representation that correlates well with the underlying generative process, without requiring any prior knowledge.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCell Image Analysis Techniques · Generative Adversarial Networks and Image Synthesis · Fractal and DNA sequence analysis