On the Relation Between Autoencoders and Non-negative Matrix   Factorization, and Their Application for Mutational Signature Extraction

Ida Egendal; Rasmus Froberg Br{\o}ndum; Marta Pelizzola; Asger Hobolth; and Martin B{\o}gsted

arXiv:2405.07879·stat.AP·May 14, 2024

On the Relation Between Autoencoders and Non-negative Matrix Factorization, and Their Application for Mutational Signature Extraction

Ida Egendal, Rasmus Froberg Br{\o}ndum, Marta Pelizzola, Asger Hobolth, and Martin B{\o}gsted

PDF

Open Access

TL;DR

This study explores the theoretical relationship between non-negative autoencoders and NMF, revealing that convex NMF is a special case of autoencoders, and compares their effectiveness in mutational signature extraction from cancer genomics data.

Contribution

It establishes a formal connection between non-negative autoencoders and NMF, specifically through convex NMF, and evaluates their performance in a practical genomics application.

Findings

01

NMF provides more accurate reconstructions than autoencoders.

02

Both methods produce comparable mutational signatures.

03

Autoencoders do not outperform NMF in mutational signature extraction.

Abstract

The aim of this study is to provide a foundation to understand the relationship between non-negative matrix factorization (NMF) and non-negative autoencoders enabling proper interpretation and understanding of autoencoder-based alternatives to NMF. Since its introduction, NMF has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, recently, several studies have proposed to replace NMF with autoencoders. This increasing popularity of autoencoders warrants an investigation on whether this replacement is in general valid and reasonable. Moreover, the exact relationship between non-negative autoencoders and NMF has not been thoroughly explored. Thus, a main aim of this study is to investigate in detail the relationship between non-negative autoencoders and NMF. We find that the connection between the two models can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Algorithms and Data Compression