Minimal Algorithmic Information Loss Methods for Dimension Reduction, Feature Selection and Network Sparsification
Hector Zenil, Narsis A. Kiani, Alyssa Adams, Felipe S. Abrah\~ao,, Antonio Rueda-Toicen, Allan A. Zea, Luan Ozelim, Jesper Tegn\'er

TL;DR
This paper introduces a novel, domain-agnostic algorithmic complexity-based method for dimension reduction and network sparsification that outperforms traditional statistical techniques in preserving essential features and patterns.
Contribution
The authors develop a universal, unsupervised approach based on algorithmic complexity principles that improves feature preservation and reduces reliance on prior feature selection.
Findings
Outperforms classical statistical and state-of-the-art algorithms
Preserves nonlinear patterns and deterministic recursive features
Effective in lossy compression of images and multi-dimensional data
Abstract
We present a novel, domain-agnostic, model-independent, unsupervised, and universally applicable Machine Learning approach for dimensionality reduction based on the principles of algorithmic complexity. Specifically, but without loss of generality, we focus on addressing the challenge of reducing certain dimensionality aspects, such as the number of edges in a network, while retaining essential features of interest. These features include preserving crucial network properties like degree distribution, clustering coefficient, edge betweenness, and degree and eigenvector centralities but can also go beyond edges to nodes and weights for network pruning and trimming. Our approach outperforms classical statistical Machine Learning techniques and state-of-the-art dimensionality reduction algorithms by preserving a greater number of data features that statistical algorithms would miss,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Statistical Methods and Inference · Neural Networks and Applications
