On Universal Features for High-Dimensional Learning and Inference
Shao-Lun Huang, Anuran Makur, Gregory W. Wornell, and Lizhong Zheng

TL;DR
This paper introduces a unified information geometric framework for identifying universal low-dimensional features in high-dimensional data, enhancing understanding and optimization of various learning systems.
Contribution
It develops a novel theoretical framework connecting multiple classical and modern analysis tools for high-dimensional feature extraction and inference.
Findings
Unified geometric framework for feature extraction
Connections among SVD, CCA, information bottleneck, and more
Applications to neural networks, matrix factorization, and semi-supervised learning
Abstract
We consider the problem of identifying universal low-dimensional features from high-dimensional data for inference tasks in settings involving learning. For such problems, we introduce natural notions of universality and we show a local equivalence among them. Our analysis is naturally expressed via information geometry, and represents a conceptually and computationally useful analysis. The development reveals the complementary roles of the singular value decomposition, Hirschfeld-Gebelein-R\'enyi maximal correlation, the canonical correlation and principle component analyses of Hotelling and Pearson, Tishby's information bottleneck, Wyner's common information, Ky Fan -norms, and Brieman and Friedman's alternating conditional expectations algorithm. We further illustrate how this framework facilitates understanding and optimizing aspects of learning systems, including multinomial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Statistical Mechanics and Entropy · Machine Learning and Algorithms
