Exhaustive search for sparse variable selection in linear regression
Yasuhiko Igarashi, Hikaru Takenaka, Yoshinori Nakanishi-Ohno, Makoto, Uemura, Shiro Ikeda, Masato Okada

TL;DR
This paper introduces exhaustive and approximate methods for selecting sparse variables in linear regression, enabling detailed analysis of variable combinations and comparison of selection techniques, especially useful for large datasets.
Contribution
It proposes the ES-K and AES-K methods for exhaustive and approximate sparse variable selection, and introduces density of states as a new analytical tool for comparing selection methods.
Findings
Confirmed conventional understanding in astronomy with appropriate K.
Identified difficulty in determining K from data.
Attributed data shortage as a cause for selection challenges.
Abstract
We propose a K-sparse exhaustive search (ES-K) method and a K-sparse approximate exhaustive search method (AES-K) for selecting variables in linear regression. With these methods, K-sparse combinations of variables are tested exhaustively assuming that the optimal combination of explanatory variables is K-sparse. By collecting the results of exhaustively computing ES-K, various approximate methods for selecting sparse variables can be summarized as density of states. With this density of states, we can compare different methods for selecting sparse variables such as relaxation and sampling. For large problems where the combinatorial explosion of explanatory variables is crucial, the AES-K method enables density of states to be effectively reconstructed by using the replica-exchange Monte Carlo method and the multiple histogram method. Applying the ES-K and AES-K methods to type Ia…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
