Model-Agnostic and Uncertainty-Aware Dimensionality Reduction in Supervised Learning

Yue Yu; Guanghui Wang; Liu Liu; Changliang Zou

arXiv:2601.10357·stat.ME·January 16, 2026

Model-Agnostic and Uncertainty-Aware Dimensionality Reduction in Supervised Learning

Yue Yu, Guanghui Wang, Liu Liu, Changliang Zou

PDF

Open Access

TL;DR

This paper presents POD, a flexible, model-agnostic framework for dimension reduction in supervised learning that directly assesses predictive utility and quantifies uncertainty, improving the selection of intrinsic data dimensions.

Contribution

Introduces POD, a novel, predictive performance-based, model-agnostic approach for determining the minimal sufficient data dimension with uncertainty quantification.

Findings

01

POD accurately estimates data dimensions in simulations.

02

POD provides reliable uncertainty bounds for order estimation.

03

POD outperforms traditional methods in real-data applications.

Abstract

Dimension reduction is a fundamental tool for analyzing high-dimensional data in supervised learning. Traditional methods for estimating intrinsic order often prioritize model-specific structural assumptions over predictive utility. This paper introduces predictive order determination (POD), a model-agnostic framework that determines the minimal predictively sufficient dimension by directly evaluating out-of-sample predictiveness. POD quantifies uncertainty via error bounds for over- and underestimation and achieves consistency under mild conditions. By unifying dimension reduction with predictive performance, POD applies flexibly across diverse reduction tasks and supervised learners. Simulations and real-data analyses show that POD delivers accurate, uncertainty-aware order estimates, making it a versatile component for prediction-centric pipelines.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis · Machine Learning and ELM