Diffusion-Guided Feature Selection via Nishimori Temperature: Noise-Based Spectral Embedding
Vasiliy S. Usatyuk, Denis A. Sapozhnikov, Sergey I. Egorov

TL;DR
This paper introduces Noise-Based Spectral Embedding (NBSE), a physics-inspired method for feature selection that maintains high classification accuracy under aggressive data compression.
Contribution
NBSE leverages Nishimori temperature and spectral analysis to identify and select informative features without greedy search, improving robustness and efficiency.
Findings
NBSE preserves classification accuracy with only 30% features on ImageNet embeddings.
It outperforms ANOVA F-test and random selection by up to 6.8%.
The method is robust to measurement noise, with limited shift in Nishimori temperature.
Abstract
We propose Noise-Based Spectral Embedding (NBSE), a physics-informed framework for selecting informative features from high-dimensional data without greedy search. NBSE constructs a sparse similarity graph on the samples and identifies the Nishimori temperature the critical inverse temperature at which the Bethe Hessian becomes singular. The corresponding smallest eigenvector captures the dominant mode of an intrinsically degree-corrected diffusion process, naturally reweighting nodes to prevent hub dominance. By transposing the data matrix and applying NBSE in feature space, we obtain a one-dimensional spectral embedding that reveals groups of redundant or semantically related dimensions; balanced binning then selects one representative per group. We prove that coloured Gaussian perturbations shift by at most , guaranteeing robustness to measurement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
