On the Relation Between Autoencoders and Non-negative Matrix Factorization, and Their Application for Mutational Signature Extraction
Ida Egendal, Rasmus Froberg Br{\o}ndum, Marta Pelizzola, Asger Hobolth, and Martin B{\o}gsted

TL;DR
This study explores the theoretical relationship between non-negative autoencoders and NMF, revealing that convex NMF is a special case of autoencoders, and compares their effectiveness in mutational signature extraction from cancer genomics data.
Contribution
It establishes a formal connection between non-negative autoencoders and NMF, specifically through convex NMF, and evaluates their performance in a practical genomics application.
Findings
NMF provides more accurate reconstructions than autoencoders.
Both methods produce comparable mutational signatures.
Autoencoders do not outperform NMF in mutational signature extraction.
Abstract
The aim of this study is to provide a foundation to understand the relationship between non-negative matrix factorization (NMF) and non-negative autoencoders enabling proper interpretation and understanding of autoencoder-based alternatives to NMF. Since its introduction, NMF has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, recently, several studies have proposed to replace NMF with autoencoders. This increasing popularity of autoencoders warrants an investigation on whether this replacement is in general valid and reasonable. Moreover, the exact relationship between non-negative autoencoders and NMF has not been thoroughly explored. Thus, a main aim of this study is to investigate in detail the relationship between non-negative autoencoders and NMF. We find that the connection between the two models can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Algorithms and Data Compression
