Finding Rule-Interpretable Non-Negative Data Representation

Matej Mihel\v{c}i\'c; Pauli Miettinen

arXiv:2206.01483·cs.LG·February 4, 2025

Finding Rule-Interpretable Non-Negative Data Representation

Matej Mihel\v{c}i\'c, Pauli Miettinen

PDF

Open Access

TL;DR

This paper introduces a novel NMF-based method that integrates rule-based descriptions to produce interpretable, part-based, lower-dimensional representations of non-negative data, enhancing interpretability and focus on specific labels.

Contribution

The proposed approach merges rule-based descriptions with NMF to generate interpretable, rule-described latent factors in non-negative data, improving interpretability over traditional NMF.

Findings

01

Creates rule-described non-negative data representations

02

Reveals attribute importance and interactions

03

Enables focused embedding with multiple labels

Abstract

Non-negative Matrix Factorization (NMF) is an intensively used technique for obtaining parts-based, lower dimensional and non-negative representation. Researchers in biology, medicine, pharmacy and other fields often prefer NMF over other dimensionality reduction approaches (such as PCA) because the non-negativity of the approach naturally fits the characteristics of the domain problem and its results are easier to analyze and understand. Despite these advantages, obtaining exact characterization and interpretation of the NMF's latent factors can still be difficult due to their numerical nature. Rule-based approaches, such as rule mining, conceptual clustering, subgroup discovery and redescription mining, are often considered more interpretable but lack lower-dimensional representation of the data. We present a version of the NMF approach that merges rule-based descriptions with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGene expression and cancer classification · Biomedical Text Mining and Ontologies · Machine Learning in Bioinformatics