Data-driven configuration tuning of glmnet for balancing accuracy and computational efficiency
Shuhei Muroya, Kei Hirose

TL;DR
This paper introduces a data-driven framework using neural networks to optimize glmnet configuration, balancing solution accuracy and computational efficiency, and enabling automatic selection based on user constraints.
Contribution
It presents a systematic, neural network-based approach to tune glmnet configurations for optimal tradeoffs between accuracy and speed, implemented in an R package.
Findings
Neural networks accurately predict glmnet accuracy and computation time.
The framework effectively generates Pareto fronts for configuration tradeoffs.
Automatic configuration selection improves solution quality within time constraints.
Abstract
The glmnet package in R is widely used for lasso estimation because of its computational efficiency. Despite its popularity, glmnet occasionally yields solutions that deviate substantially from the true ones because of the inappropriate default configuration of the algorithm. The accuracy of the obtained solutions can be improved by appropriately tuning the configuration. However, such improvements typically increase computational time, resulting in a tradeoff between accuracy and computational efficiency. Therefore, a systematic approach is required to determine the appropriate configuration. To address this need, we propose a unified data-driven framework specifically designed to optimize the configuration by balancing solution path accuracy and computational cost. Specifically, we generate a large-scale training dataset by measuring the accuracy and computation time of glmnet. Using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Data Analysis with R · Machine Learning and Data Classification
