ABM: an automatic supervised feature engineering method for loss based models based on group and fused lasso
Weijian Luo, Yongxian Long

TL;DR
ABM is an innovative supervised feature engineering method that automatically selects optimal binning points for variables, integrating feature engineering, variable selection, and model training, applicable to various loss-based models including neural networks.
Contribution
The paper introduces ABM, a novel end-to-end supervised binning method based on group and fused lasso, enhancing feature engineering and variable selection simultaneously.
Findings
Effective automatic binning improves model performance.
Applicable to diverse loss-based models including deep neural networks.
Open-source implementation available in R and Python.
Abstract
A vital problem in solving classification or regression problem is to apply feature engineering and variable selection on data before fed into models.One of a most popular feature engineering method is to discretisize continous variable with some cutting points,which is refered to as bining processing.Good cutting points are important for improving model's ability, because wonderful bining may ignore some noisy variance in continous variable range and keep useful leveled information with good ordered encodings.However, to our best knowledge a majority of cutting point selection is done via researchers domain knownledge or some naive methods like equal-width cutting or equal-frequency cutting.In this paper we propose an end-to-end supervised cutting point selection method based on group and fused lasso along with the automatically variable selection effect.We name our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Statistical Methods and Inference · Face and Expression Recognition
