Additive Approximations in High Dimensional Nonparametric Regression via the SALSA
Kirthevasan Kandasamy, Yaoliang Yu

TL;DR
SALSA introduces a middle-ground additive model for high-dimensional nonparametric regression, balancing bias and variance by controlling interaction order, and demonstrates competitive performance on real datasets.
Contribution
The paper proposes SALSA, an innovative method that allows variable interactions of limited order, bridging the gap between simple additive models and complex non-additive models.
Findings
Excess risk is polynomial in dimension for additive functions.
Efficient summation over combinatorial terms using Girard-Newton formulae.
Competitive performance on 15 real datasets against 21 methods.
Abstract
High dimensional nonparametric regression is an inherently difficult problem with known lower bounds depending exponentially in dimension. A popular strategy to alleviate this curse of dimensionality has been to use additive models of \emph{first order}, which model the regression function as a sum of independent functions on each dimension. Though useful in controlling the variance of the estimate, such models are often too restrictive in practical settings. Between non-additive models which often have large variance and first order additive models which have large bias, there has been little work to exploit the trade-off in the middle via additive models of intermediate order. In this work, we propose SALSA, which bridges this gap by allowing interactions between variables, but controls model capacity by limiting the order of interactions. SALSA minimises the residual sum of squares…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Statistical Methods and Inference
