A lasso for hierarchical interactions

Jacob Bien; Jonathan Taylor; Robert Tibshirani

arXiv:1205.5050·stat.ME·June 20, 2013

A lasso for hierarchical interactions

Jacob Bien, Jonathan Taylor, Robert Tibshirani

PDF

TL;DR

This paper introduces a convex constrained lasso method that enforces hierarchical structure in interaction models, improving interpretability and data collection efficiency.

Contribution

It provides a novel hierarchical lasso formulation, theoretical analysis, and an algorithm implemented in R for practical sparse interaction modeling.

Findings

01

Hierarchy constraint holds with probability one.

02

Unbiased degrees of freedom estimate derived.

03

Empirical study demonstrates effectiveness.

Abstract

We add a set of convex constraints to the lasso to produce sparse interaction models that honor the hierarchy restriction that an interaction only be included in a model if one or both variables are marginally important. We give a precise characterization of the effect of this hierarchy constraint, prove that hierarchy holds with probability one and derive an unbiased estimate for the degrees of freedom of our estimator. A bound on this estimate reveals the amount of fitting "saved" by the hierarchy constraint. We distinguish between parameter sparsity - the number of nonzero coefficients - and practical sparsity - the number of raw variables one must measure to make a new prediction. Hierarchy focuses on the latter, which is more closely tied to important data collection concerns such as cost, time and effort. We develop an algorithm, available in the R package hierNet, and perform an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.