Data-driven configuration tuning of glmnet for balancing accuracy and computational efficiency

Shuhei Muroya; Kei Hirose

arXiv:2602.17922·stat.CO·March 18, 2026

Data-driven configuration tuning of glmnet for balancing accuracy and computational efficiency

Shuhei Muroya, Kei Hirose

PDF

Open Access

TL;DR

This paper introduces a data-driven framework using neural networks to optimize glmnet configuration, balancing solution accuracy and computational efficiency, and enabling automatic selection based on user constraints.

Contribution

It presents a systematic, neural network-based approach to tune glmnet configurations for optimal tradeoffs between accuracy and speed, implemented in an R package.

Findings

01

Neural networks accurately predict glmnet accuracy and computation time.

02

The framework effectively generates Pareto fronts for configuration tradeoffs.

03

Automatic configuration selection improves solution quality within time constraints.

Abstract

The glmnet package in R is widely used for lasso estimation because of its computational efficiency. Despite its popularity, glmnet occasionally yields solutions that deviate substantially from the true ones because of the inappropriate default configuration of the algorithm. The accuracy of the obtained solutions can be improved by appropriately tuning the configuration. However, such improvements typically increase computational time, resulting in a tradeoff between accuracy and computational efficiency. Therefore, a systematic approach is required to determine the appropriate configuration. To address this need, we propose a unified data-driven framework specifically designed to optimize the configuration by balancing solution path accuracy and computational cost. Specifically, we generate a large-scale training dataset by measuring the accuracy and computation time of glmnet. Using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Data Analysis with R · Machine Learning and Data Classification