Getting more from your regression model: A free lunch?

David P. Hofmeyr

arXiv:2203.10459·stat.ME·March 22, 2022

Getting more from your regression model: A free lunch?

David P. Hofmeyr

PDF

Open Access

TL;DR

This paper introduces a simple method to extract detailed conditional distribution information from standard regression models, showing it performs well for quantile estimation and rivals specialized methods across various datasets.

Contribution

The authors propose a straightforward approach to approximate conditional distributions from standard regression outputs, enhancing quantile regression without additional complex modeling.

Findings

01

Method performs well for tail probability estimation

02

Comparable to quantile regression forest models

03

Effective across diverse benchmark datasets

Abstract

We consider a simple approach for approximating detailed information about the conditional distribution of a real-valued response variable, given values for its covariates, using only the outputs from a standard regression model. We validate this approach by assessing its performance in the context of quantile regression; when applied to the outputs of linear, gradient boosted tree ensemble and random forest models. We find that it compares favourably to the standard approach for estimating quantile regression functions, especially for commonly selected tail probabilities, and is highly competitive with the quantile regression forest model, across a large collection of benchmark data sets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Gaussian Processes and Bayesian Inference · Bayesian Modeling and Causal Inference