From Uniform to Learned Knots: A Study of Spline-Based Numerical Encodings for Tabular Deep Learning
Manish Kumar, Anton Frederik Thielmann, Christoph Weisser, Benjamin S\"afken

TL;DR
This study explores spline-based numerical encodings for tabular deep learning, analyzing their impact on model performance across different tasks, backbones, and encoding strategies, including learnable knots.
Contribution
It systematically evaluates various spline families and knot placement strategies, introducing a differentiable parameterization for learnable knots in numerical encodings.
Findings
Piecewise-linear encoding (PLE) is most robust for classification.
Spline-based encodings are competitive but task-dependent.
Learnable knots can be optimized but increase training cost.
Abstract
Numerical preprocessing remains an important component of tabular deep learning, where the representation of continuous features can strongly affect downstream performance. Although its importance is well established for classical statistical and machine learning models, the role of explicit numerical preprocessing in tabular deep learning remains less well understood. In this work, we study this question with a focus on spline-based numerical encodings. We investigate three spline families for encoding numerical features, namely B-splines, M-splines, and integrated splines (I-splines), under uniform, quantile-based, target-aware, and learnable-knot placement. For the learnable-knot variants, we use a differentiable knot parameterization that enables stable end-to-end optimization of knot locations jointly with the backbone. We evaluate these encodings on a diverse collection of public…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
