LassoFlexNet: Flexible Neural Architecture for Tabular Data
Kry Yik Chau Lui, Cheng Chi, Kishore Basu, Yanshuai Cao

TL;DR
LassoFlexNet is a novel neural architecture that incorporates inductive biases and variable selection mechanisms to improve deep learning performance on tabular data, matching or surpassing tree-based models.
Contribution
It introduces a flexible neural architecture with a Tied Group Lasso mechanism and a new optimizer, enhancing interpretability and stability for tabular data modeling.
Findings
Outperforms leading tree-based models on 52 datasets with up to 10% relative gain.
Achieves Lasso-like interpretability while maintaining high predictive accuracy.
Theoretical proofs confirm increased expressivity and breaking of rotational invariance.
Abstract
Despite their dominance in vision and language, deep neural networks often underperform relative to tree-based models on tabular data. To bridge this gap, we incorporate five key inductive biases into deep learning: robustness to irrelevant features, axis alignment, localized irregularities, feature heterogeneity, and training stability. We propose \emph{LassoFlexNet}, an architecture that evaluates the linear and nonlinear marginal contribution of each input via Per-Feature Embeddings, and sparsely selects relevant variables using a Tied Group Lasso mechanism. Because these components introduce optimization challenges that destabilize standard proximal methods, we develop a \emph{Sequential Hierarchical Proximal Adaptive Gradient optimizer with exponential moving averages (EMA)} to ensure stable convergence. Across datasets from three benchmarks, LassoFlexNet matches or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications
