DISCOMAX: A Proximity-Preserving Distance Correlation Maximization Algorithm
Praneeth Vepakomma, Ahmed Elgammal

TL;DR
This paper introduces DISCOMAX, an algorithm that learns low-dimensional features by maximizing distance correlation with the response, aiding prediction without relying on specific regression models.
Contribution
It proposes a novel, model-free feature learning algorithm that maximizes dependence measures, combining majorization-minimization and concave-convex optimization techniques.
Findings
Algorithm effectively learns low-dimensional features.
Convergence results established via spectral radius analysis.
Enhances prediction accuracy with reduced features.
Abstract
In a regression setting we propose algorithms that reduce the dimensionality of the features while simultaneously maximizing a statistical measure of dependence known as distance correlation between the low-dimensional features and a response variable. This helps in solving the prediction problem with a low-dimensional set of features. Our setting is different from subset-selection algorithms where the problem is to choose the best subset of features for regression. Instead, we attempt to generate a new set of low-dimensional features as in a feature-learning setting. We attempt to keep our proposed approach as model-free and our algorithm does not assume the application of any specific regression model in conjunction with the low-dimensional features that it learns. The algorithm is iterative and is fomulated as a combination of the majorization-minimization and concave-convex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Bayesian Methods and Mixture Models · Data Management and Algorithms
