Minimal Algorithmic Information Loss Methods for Dimension Reduction,   Feature Selection and Network Sparsification

Hector Zenil; Narsis A. Kiani; Alyssa Adams; Felipe S. Abrah\~ao,; Antonio Rueda-Toicen; Allan A. Zea; Luan Ozelim; Jesper Tegn\'er

arXiv:1802.05843·cs.DS·May 6, 2025·5 cites

Minimal Algorithmic Information Loss Methods for Dimension Reduction, Feature Selection and Network Sparsification

Hector Zenil, Narsis A. Kiani, Alyssa Adams, Felipe S. Abrah\~ao,, Antonio Rueda-Toicen, Allan A. Zea, Luan Ozelim, Jesper Tegn\'er

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel, domain-agnostic algorithmic complexity-based method for dimension reduction and network sparsification that outperforms traditional statistical techniques in preserving essential features and patterns.

Contribution

The authors develop a universal, unsupervised approach based on algorithmic complexity principles that improves feature preservation and reduces reliance on prior feature selection.

Findings

01

Outperforms classical statistical and state-of-the-art algorithms

02

Preserves nonlinear patterns and deterministic recursive features

03

Effective in lossy compression of images and multi-dimensional data

Abstract

We present a novel, domain-agnostic, model-independent, unsupervised, and universally applicable Machine Learning approach for dimensionality reduction based on the principles of algorithmic complexity. Specifically, but without loss of generality, we focus on addressing the challenge of reducing certain dimensionality aspects, such as the number of edges in a network, while retaining essential features of interest. These features include preserving crucial network properties like degree distribution, clustering coefficient, edge betweenness, and degree and eigenvector centralities but can also go beyond edges to nodes and weights for network pruning and trimming. Our approach outperforms classical statistical Machine Learning techniques and state-of-the-art dimensionality reduction algorithms by preserving a greater number of data features that statistical algorithms would miss,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Statistical Methods and Inference · Neural Networks and Applications